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ON WARING’S PROBLEM FOR FOURTH POWERS 


By H. DavENPORT 


(Received April 21, 1939) 
Introduction 


The object of this paper is to give proofs of the following theorems: 

Every sufficiently large number is representable as the sum of 14 fourth powers, 
unless it is congruent to 15 or 16 (mod 16). 

Every sufficiently large number is representable as the sum of 15 fourth powers, 
unless it is of the form 16"k, where k has one of a finite number of values. 

Every sufficiently large number is representable as the sum of 16 fourth powers. 

The second and third of these theorems are immediate consequences of the 
first. It is well known that the third theorem is the best possible result of its 
kind, since 16" X 31 is not the sum of less than 16 fourth powers. 

These results are a considerable improvement on the theorem (proved simul- 
taneously by Estermann and by Heilbronn and myself) that every large number 
is the sum of 17 fourth powers. The improvement results from the use of a new 
method, of which I have given accounts elsewhere,’ for constructing different 
numbers which are the sums of s positive integral k** powers. The reader who is 
familiar with Waring’s problem will see from the proof of Lemma 1 exactly what 
the new idea is, and will probably be content to take the rest of the paper for 
granted. But for the sake of other readers, the whole proof has been presented 
in fairly complete detail. 

The only external sources referred to are: (1) Landau, Vorlesungen itiber 
Zahlentheorie (Leipzig, 1927), volume 1; (2) Davenport and Heilbronn, “On an 
exponential sum,’”’ Proc. London Math. Soc., 41 (1936), 449-453. 


The fundamental lemmas 


Throughout the paper, all small Latin letters (with or without suffixes) except 
¢, 1, f, denote positive integers, and ¢ denotes an arbitrarily small positive number. 
In the present section, the constants implied by the symbol O depend only on e. 

Lemma 1.” Suppose that P is a large positive integer, and that wu. < U2 < --- 
< uy < P** where 0 <u <4. Suppose also that 


0 U > prow, 





‘Proc. Royal Soc.; Acta Arithmetica. An account of an alternative method, due to 
Erdés, is in course of publication in these Annals. This method yields a result for four 
fourth powers which would allow one to prove the third of the theorems enunciated above, 
but not the first two. 

_ * The general form of the result will be found in the author’s paper ‘‘On sums of positive 
integral kt» powers,’’ Acta Arithmetica. 
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732 H. DAVENPORT 


Then the number of solutions of 


(2) ttm =y'+u;, 
subject to 

(3) P3292, “Po y@ 3P, 
1s 


O(P?U*?P** ). 
Proor. The number of solutions with x = y is obviously 
O(PU) = O(P?U’P****), 


by (1). Hence it suffices to consider solutions with y > x. Writing y = x + ¢ 
(2) becomes 


’ 


(4) 42°t + 62°? + 4e¢° + t+ u; = Mm. 
Plainly 

Piszit<m <P, 
whence 
(5) <i 


Denote by M, the number of solutions of (4), subject to (3), (5), and by M(t, us) 
the number of solutions for given values of t, u,. By Cauchy’s inequality, 


Mi= Miu) s (2 ty (2 Mt, ui) 


twp twp Luh 
< (P*U)(My)', 

where M; denotes the number of solutions of 

(6) 4nt+..- +f +u;=4¢%4+...-¢+¢ +47 =m, 


in which z’ is subject to the same inequality as x. The number of solutions with 


/ 


x’ = xis precisely M,. In the solutions with x’ > z, we write x’ = x +h. 
(6) implies 


(7) tt(122” + 122(t + th) + 40 + 6tt, + 447) + u; = u,;, 
and # is subject tot; S P. Hence 
M; < M, + 2M, = O(max (Mi, M2)), 


where M; denotes the number of solutions of (7). Thus 


(8) M, = O(P*U) + O(P*UM,)}. 





tn) 
iy; 
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We now repeat on (7) the argument which was applied to (4). Let M(t, t, u;) 
denote the number of solutions of (7) for given values of t,t, u;. By Cauchy’s 
inequality. 


Mz= >> MGt,u) s ( , 1)'( > Mt, hw)! 


9 sshiny tee 
< (P*"U)*(M;), 
where M; denotes the number of solutions of 
(9) ty(122" + ---) + uy = t(12e" + ---) + uj = u;. 


The number of solutions with x’ = z is precisely M;. In the solutions with 
z' > «we write x’ = x +%,. (9) implies 


(10) 12tte(2a + ¢ + th + te) = uj — Uj, 
and t; is subject to tz S P. Hence 
Mz S Mz + 2M; = O (max (Mz, M;)), 
where M; denotes the number of solutions of (10). Thus 
(11) M, = O(P*"U) + O(P*'UM;)!. 
Given u;, uj, the equation (10) allows only O(P*) possibilities for t, t; , t , 
2c + t+ t + te as factors of uj; — u;.° Hence 
(12) M; = O(P‘U’). 
By (8), (11), (12), 
M, = O(P*U) + O(P*U PU + PEE} 
= o(P**U) + O(P* tu"), 
Since u» < 3, 
Pty s PU = O(P'U'’P**). 
Also, by (1), 
putt (75/4 _ (petit [72 p- 0 A-w) He) 
= OP UP, 


This proves Lemma 1. 

Lemma 2. Suppose that s = 2, and that f is one of 0, 1, 2,---,8. For 
n> no(e), there exist at least n’* ‘ numbers less than n representable as the sum of s 
fourth powers and congruent to f (mod (16), where 


(3) y=t oo a ae — 3+ 18% 
2 9’ ¥3 28’ 74 412’ ’ Ys 4(3 + %-1) 


_—_— 


> Landau, Satz 261. 
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734 H. DAVENPORT 


Proor. (1)s = 2. Letf = fi + fo, where fi = Oorlandf,=Oorl. The 
number of pairs 2 , 22 satisfying 


a < (4n)', a < (3n)', =m = fi (mod 2), — tz = fe (mod 2) 


is greater than Cn', where C is a positive absolute constant. It is well known‘ 
that the number of representations of an integer m as zi + 22 is O(m‘). Hence 
there are at least 

Cn} 4—2€ 

- n 

O(n’) 7 
numbers less than n representable as the sum of 2 fourth powers and = fi + f; = 
f (mod 16). 

(2) s > 2. We assume that the assertion of the lemma is true for s — 1, 

with 4 S y.-1 < 1, and deduce that it is true for s where y, is given by the last 


formula of (13). 
Let f = fi + fe, where f; is one of 0, 1, --- ,s — 1, and fpisOorl. Let 





3(1 vx4 Ys-1) 
14 ae po AVERT on I 
(14) be gr Sei 
Since } < y.-1 < 1, we have 0 < » < 4. Let P = [£(3n)']. Let wm < 
us < +++ < w < P*** be the numbers less than P*** representable as the sum 


of s — 1 fourth powers and congruent to f; (mod 16). By hypothesis 
U> Pt 1s-1-¢ 


od  cadiee 
by (14). 
Let r(m) denote the number of representations of m as x* + u,, where P < 
x S 2P,and x = fe (mod 2). Plainly 


Dd r(m) = $PU. 


m 


Also >> r°(m) does not exceed the number of solutions of (2) subject to (3). 


Since the conditions of Lemma 1 are satisfied, 


> r(m) = O(P?U? P™***5), 


™ 


The number of numbers less than n representable as the sum of s fourth 
powers and congruent to f (mod 16) is 


: (do r(m))? Ws 
> + @ Bt oo 
ae 


m 


Ys—€ 
Ye eee 





‘ Landau, Satz 262. 
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where 
3 + 137.1 
(e406 +3) = ST 
aT a 
by (14). 
Notation 


Let N be the large integer, not congruent to 15 or 16 (mod 16), which is tobe 
represented as the sum of 14 fourth powers. 
There exists an integer f = 0, 1, 2, 3, or 4, such that 


(15) N — 2f = 1, 2, 3, 4, 5, or 6 (mod 16). 
Let 
N 3 
: r-((2)) 
and let 
ty y= =. 


Let u < Ue < +--+ < uy < P*** be the numbers less than P“** representable 
as the sum of 4 fourth powers and congruent to f (mod 16). By Lemma 2, 


= 331 
1 +3) 74~€ on. 


We observe that 
(19) (u + 3)y4 = 3(1 — yp). 


For any real a, let 


2P 


T (a) = >> e(az"), 


z=P 


Tub « ded, 


h=1 


where e(A) is an abbreviation for e'"**. Let 


T° (a) U*(a) = > ri(m)e(ma). 


In order to prove the main theorem, it suffices to prove that ru(N) > 0. For 
any integer £, we have 


(20) | ela) da = : aE bs 





: CO Cr ES ee TT 
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where the range of integration is any interval of length 1. Hence 
(21) ru(N) = | T*a)U*(a)e(—Na) dee 


Throughout the paper, a, q are subject to a S q, (a, q) = 1. Let 


Saq = > eq(az'), (cata) - -(4)). 


z=1 


For any real 8, let 
(2P)4 


1(8) = Dy tn *e(6n), 
and let 


T*(a,a,9q) = q" 8.41 (c ~ *), 


5 denotes a small positive number, fixed throughout the paper. Except in 
the next section, the constants implied by the symbol O depend only on 4, «. 
C, , C2 , Cs denote positive absolute constants. 


Approximations to 7T'(a) 


The essence of the Hardy-Littlewood method consists in approximating to 
T(a) by T*(a, a, q) when a lies in a certain interval surrounding the rational 
point a/q, and in using an upper bound for 7'(a) (such as that provided by 
Weyl’s inequality) when a does not lie in any of these intervals. This apparatus 
of approximations and upper bounds is no simpler in the case of the exponent 4 
than in the case of the general exponent. Consequently, we consider in this 
section the general sum 


2P 
T(a) = 2) e(az"), 

z=P 
where’ k = 4; and we allow the constants implied by the symbol 0 to depend 
on k as well as on 6, e«. The advantage of having the results set out in a form 
suitable for general reference may, perhaps, compensate for this intrusion of 
unnecessary generality. 

We define 


q 
Se = > e,(ax"), 
(2P)* 1 


I(g) = zn **"*e(Bn), 


n=—Pk k 


T*(a, a, 9) = Sool (« a *), 





’ The corresponding results for k = 3 (some of which are more precise) are given in the 
author’s paper ‘‘On Waring’s problem for cubes,’’ Acta Math. 
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Lemma 3. Sae = O(g'™*). 
Proor. Landau, Satz 315. 
Lemma 4. If |8| S 4, then 


I(8) = O (min(P, P*™| 6 |"). 


Proor. The inequality I(6) = O(P) is trivial. Also, if |8| < 4, we have 


DX e(6n) = O(\6{") 


n=n} 


forany m1, 2. Hence, by Abel’s Lemma, 


(2P)* 


> n*t* (Bn) = o(P | 8 eel 


n=Pk 
Lemma 5. Ifa= as B, where | B| S 3, then 


T*(a, a, q) = Og min (P, P** | 8 |"). 


Proor. Lemmas 3, 4. 
Lemma 6. For any integer v, let 


q 
Saar = p> eq(ac* + va). 
Then, if v # 0, 
Saas = O(q***(q, v)). 


Proor. Davenport-Heilbronn, Lemma 3. 
LemMa 7. Suppose that 


H > 1, q < H** B a O(q'‘H’*”), 
and let v # 0 be an integer. Then 


"ol pt ie +a) ~ 1) 4 
| «(ae 8) a = — 98, (¢( ou : 1) + O@~H”). 

Proor. By integration by parts | times, the integral becomes 
ae »_vH > ( q y"| ( “) Mec aet ii 

ots (e(ant - 2) — 1) - (2) ]e(—*8) vrewen | 

l H 
q vE\ ni k 
+ (;;) | é (- ®) p (e(BE")) dé, 


where D" denotes the h*" derivative with respect to & and [F(£)]o = 
FH) — F(0). It is easily verified that 


D*(e(g¢")) = wees C(r, h, k)p’e*”* e(Be'), 








as ee Seay 
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where C(r, h, k) depends only on the variables specified. For 0 < ~¢ < H 


Sr Sh, we have 


’ 


ser - O(qH" * HH") 
ee O(q (qH ***) rH) 
oe (q*H™) . 
Hence, for 0 S é S H, 
D'(e(Bé")) = O(C(h, k)q"H™). 


Using this in the above expression, the second and third terms are 


(& (4)" C(h, k)g* Ho + (4) HC(, bq” x), 


imi \| »| |y| 


Choose | to be the least integer for which 1 — 146 S —é. Then the last ex- 
pression is 


O(qv HH). 
Lemma 8. If a = = + B, where q < P** and B = O(¢'P"*”), then 


T (a) ae T*(a, a, q) + O(q***). 
Proor. We have 


T(a) = x e((2 a a) ma + ny) 
h=1 (P—h)/qsms(2P—h)/q qd 


= p e,(ah") z e(8(mq + h)'). 


(P—h)/asms (2P—h)/q 


By Poisson’s summation formula, 


sy e(8(mq + h)") = I 


(P—h)/qsms(2P—h)/q 


(2P—h)/q 


e(B(éq + h)") dé 


0 (2P—h)/q . 
+2 / e(B(éq + h)” — vé) dé 
ory (P—h)/q@ 


= 2P ‘ si 00 2P . vé 
mi [ e(BE") dé + q P77 €q(vh) [ (ae o a 
v0 
where >°’ denotes that any term with m = (P — h)/q or m = (2P — h)/q 
is counted with a factor 3. There are at most two values of h for which such a 
term exists, so the replacement of >>’ by }> introduces only an error O(1) in 
T(a). Also the first integral on the right is 


(2P)k 


al ng Alhe(on) dn = 2S ne(6n) +0 (0) 
k Jpe 0 5 nies k * e\pn ’ 


n=P 


sin 


say 


yi 
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H, since, for 7 = 2 + O(1), P*’<ns (2P)*, we have | 
n +" *e(Bn) =e n **¥¥e(Bn) a O(n tH | B | + tH) | | 
eo O(P*“*q'P + ym) i 
a a 
= O(P"). 
Hence ti 
oo 2P vé t 
T(a) = T*(a, a, q) + O(1) + »» €Bace [ (ae - a 
v0 ' 
= T*(a,a,q) + O11) + d, 
say. ; 
P 2P ie 
The conditions of Lemma 7 are satisfied for | and / , hence | 
° \ 
“X= “ 4 
T= at 3 Seer (e( acer)" — 72) — «(ap — ”*)) 
201 y=—oo q q : 
v0 
o( p q- | Sa,¢. | wr") i 
"a0. : 
if 
a _ > + y ae bh 
T ; 
say. 
By Lemma 6, 
du: = O (Pp LP, ») 
pe O (< > d x (md)-*) 
lo m= 
= 0(q'™). 
dé Also 
Y= e(g(2P)) Et warte( — 22") 
iE, |\¥|>¢? q 
— e(BP*) pm y warte(— 7), |» “TI Seri) 
/q |v | >a? y=a—q? 
a 
. = ¢(8(2P)*) >> e(an*) >> ed sin v(h — 2P) Qn 
h=1 yeg?+1 V q 


- 060) Salat) SB ain OP ay + 0( E24) 


yeg?+1 V 
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~ofEins(' iezany)* E(" sir7)) 


Ba +O Fim 











dja msq2/a md 


= O(q'*), (22) 


where || ¢ || denotes the distance of ¢ from the nearest integer. 





This proves Lemma - Tw 
sine 
Lemma 9. If a = — “i B, where q < P'~ and B = O(q'P"**), then 
T - = 0" min (P, P** |B |”). 
Proor. The result follows from Lemmas 5, 8, since By | 
git < q/*P qi < q ip | B wa 
‘ ’ (23) 
Lemma 10. (Weyl’s inequality). Let x = az“ Then (24) 
= €€ —K —K —kex « It 
2, ea(ax*) = O(m' g'(m*™* + mg + m4"). pe 
Proor. Landau, Satz 267 (K = 2‘). (23) 
Lemma 11. If P’* <q < P** and B = O(q'P’ *”), then 
T(a) = O(P**). Sup) 
Proor. Let 
(26) 


S, = 2, e,(ax"). 
By Lemma 10, if m < (2P)*, Thei 
Sn - O(P*(P*™ + Pq“ + P***q")) 
pee o(P tts) we o(P****), 





in virtue of the inequalities satisfied by q. 
By partial summation, 





— Hen 
T(a) = 2) (Sn — Sn-s)e(6n) 
n=Pt all y 
(2p) that 
= 2, Sale(n) — e(8(n + 1))) + Samne(B((2P)* + 1)) — Srse(BP’) 
n=P (27) 
= o("( 181 +1)) a 








oe 
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a O(P' —xt+6 + pi prg tp *) 
=-0 ( re 


The Farey dissection 
Let Q = [P*"]. Forg S P, let Ma, denote the interval 


: et 





Two intervals Ma, ,¢; » Wtas.g, corresponding to different pairs a, q do not overlap, 
since 


a _ a 
N qe 








1 1 2P* _ 1 (? ‘) 
De ie bE EY Bi wa 
~ Hg ng [P] ~ Q\q * qe 
By Lemmas 5, 8, 9, (with k = 4), if a is in M.,, and a = + B, 
(23) T(a), T*(a, a, g) = Og min (P, P* | 6 |7)), 
(24) T(a) — T*(a, a, g) = O(g***). 


It is well known® that for any real a there exist A, q (where A is an integer) 
such that 
A 1 
(25) a~ = $=; q=Q-1, (A,q) = 1. 
q q 
Suppose that @ satisfies 


1 1 
8 1 142, 
(26) g<*< ee 
Then 
i @ 1 
Létqet> £52 ¢@, 
age et we 
Ais de > dice hes 9 +1. 
es Q Q Q 


Hence 1 S$ A < q, and so we are entitled to use the letter a instead of A in (25). 

Ifq < P', the points a satisfying (25) form precisely Mt.,.. We denote by m 
all points of the interval (26) which do not belong to any Dt,~. It follows 
that for any @ in m, there exist a, q such that 


S5 Ph<qsQ-1. 


(27) = 


a-7@'—-—--— 





‘Landau, Satz 158. 
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By (21), taking the interval of integration to be (26), we have 


@8) nw) = FD | 


Mag 


Major arcs 


Lemma 12, 


2, a Ite) —_ T* (a, a, q) | | U(a) da ee o(U?P-*), 


Proor. By (23), (24), if ais in Mtu,, and a = - rs B, 


T*(a) — T**(a, a, g) = O(g***g ** min (P*, P| B |). 


Hence 


| | T°(a) — T**(a, a, q)|\da = O («** 4 min (P*, Pg) is) 


= OP). 
Thus the sum of the lemma is 
“ou > LP) = Orr Zs, & 
asP} a qsP3 
ae O(U? PP), 
Lemma 13. If Dta,q denotes the part of the interval (26) not belonging to Ma, 
then 


x, Xl la, a, a) |U@)Pda = OU. 


asPt a Yaa,q 
Proor. Since the integrand is a periodic function of a with period 1, we 
can take the range of integration to be q'Q <|8| < 4, where a = ; + B. 
By Lemma 5, with k = 4, the sum of the lemma is 


o(u' x fn oe i ad cP" pap) = O(U? ™ > gt? Pg Pe) 


qsPi a 


lh 0( U? pt > a q’) 


qsPi 


sai O(U? Y Sipppeerec ® 
whence the result. 


Minor arcs 


Lemma 14. In m, T(a) = O(P*"), 
Proor. We have seen that for any a in m there exist a, q satisfying (27). 


T°(a)U*(a)e(—Na) da + [ T*(a)U*(a)e(—Na) da. 



















Rese ’ 


_ we 
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Case 1. P? <q = P**. By Lemma 9, with k = 4, 
T(a) = O(¢'*P) = O(P”*). 

Cass 2. P-*> <q SQ-—1< PP. Lemma ll with k = 4. 

tewaca 15. [| T(@) [*| Ula) Pda = O(U*P H+, 

Proor. By Laces 14 we have, in m, 
(29) T*(a) = o(P* +). 

By (20), 

[\ 7@)U(@) F aa, 

taken over any interval of length 1, is precisely the number of solutions of (2) 


subject to (3). By (17), (18), (19), the conditions of Lemma 1 are satisfied. 
Hence 


(30) / | T(a)U(a) da = O(P?U?P™***), 


The assertion of the lemma follows from (29), (30). 
The singular series 
Lemma 16. / T**(a, a, g)e(—na)da = q *(Sa,q)°e.(—na)R(n), where, for 
N-P<n <N, R(n) satisfies 
(31) 2p” < R(n) < 2P*. 
Proor. By (20), the integral in question is 


~ 1 - 
q *(Sa,q)°e¢(—na) pw 46 (m1 7 ee ne) . 
eee 
where the variables of summation are subject to 
(32) P’sm,---, S (2P)', M+ ++ $M = N. 
Since m,, --- , ms determine mg uniquely, we have 


R(n) < pe (2P)"P™ = 2P* 


Also, by (16), 100P* < N < 101P*, so that 99P* < n < 101P*. Hence, if 
18P* < m,--- mS 19P*, the value of ng determined by the last condition 
in (32) satisfies also the other conditions. 

It follows that 


R(n) > - P*(2P)"* = 2 P*. 


s ae ge et re erie ey 
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This establishes Lemma 16. 
Let 


A(n, q) = Dod °(Sa,a)°eq(—na). 


Lema 17. A(n, q) = O(q). 

Proor. Lemma 3. 

Lemma 18. If (q1, 92) = 1 then A(n, 9192) = A(m, 1)A(n, q2). 

Proor. Landau, Satz 282. 

The following notation corresponds to that of Landau, pp. 280-302, when 
k = 4, s = 6, except that in some cases we are precluded from using the same 
symbols. 

For any prime p, we define y = 1 if p > 2 andy = 4 if p = 2. 

For any prime p, and any I, n, let N (p', n) denote the number of solutions of 


ai+...+2i=n(modp'), 1s5a<p7, 


in which not all of 2, --- , 2 are divisible by p. 
Lemma 19. Let 4p + o be the exact power to which p divides n, where 0 S o S 3. 
Let 


lo = max (49 +o + 1, 49 + 7). 
Then 
A(n, p') = Oforl > h, 
and 


C.) p—1 
Xp(n) = > A(n, p’) = p"N(p’, 0) > a + pron (s’ *). 

where, if p = 0, the sum on the right isto be read as zero. 

Proor. This is the case k = 4, s = 6 of Landau’s Satz 293. (Note that 
Landau uses P for p” and £ instead of p.) 

Corotuary. If p 4 2n, then A(n, p') = Oforl > 1. 

Lemma 20. Jf p > 2, then N(p’, n) > 0 for all n. 

Proor. By Landau, Sitze 300 and 301, with s = 6, it suffices to prove that 


¥ 





6275 ,p-) +1. 


For p > 2, the right-hand side is (4, p — 1) + 1 S 5. 
Lemma 21. If p = 2, and n = 1, 2, 3, 4, 5, or 6 (mod 16), then N(p’, n) > 0. 
Proor. Since p’ = 16, the result is obvious. _ 
Lemma 22. For any prime p, and any n = 1, 2, 3, 4, 5, or 6 (mod 16), 


Xp(n) = a 


C0 


80 


4, 








of 
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Proor. Case 1. Suppose p* { n, so that p = 0. By Lemmas 19, 20, 21, 


p °"N(p", n) 


Xp(n) 


IV IV 
3, s 
s 


Case 2. Suppose p‘|n, so that p > 1 and p > 2. By Lemmas 19, 20, 
xo(n) = p "N(p’, 0) = p*. 


Lemma 23. For any prime p and any n, | A(n, p)| < Cip”. 
Proor. Landau, Satz 318, with s = 6. 

Lemma 24. For any prime p and any n, x,(n) > 1 — Cop”. 
Proor. Landau, Satz 322, with s = 6. 

Lemma 25. If n = 1, 2, 3, 4, 5, or 6 (mod 16), the series 


S(n) = > A(n, q) 


converges absolutely, and 


S(n) > C3 ° 
Proor. By the Corollary to Lemma 19, and by Lemma 23, if p 4 2n, 
| xo(n) — 1| = | A(n, p)| < Cp”. 


Hence the product [[ x»(n) is absolutely convergent, therefore by Lemma 18 
Pp 


so is the series }) A(n, q), and the two are equal. 
q=1 


By Lemmas 22, 24, 
II x(n) > © vp” II G- Gop”) 
Pp Pp 


SCeo p>Ce 
= C3. 
Lemma 26. For» = 1, >>| A(n,q)| = O(n'n). 
q21 


Proor. Any positive integer q is representable as 2’qig2 , where 
(1) q is quadratfrei and odd; 

(2) ge is odd and composed of prime powers with exponents = 2; 
(3) (%1 5 Q2) = |, 

By Lemma 17, 


A(n, 2) = 0(2"), — A(n, q2) = O(G3'). 


Also, by Lemma 19, if p'|q2, p'’4q2, and A(n, p') # 0, then < hb = 
49 + ¢ + 1, where 4p + o is the exact power to which p divides n. Since 
12 2, it follows that p' | p*“? | n°. Hence, if A(n, g2) ~ 0, we must have qe | n°. 
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By Lemmas 18, 23, 
|A(n, m)| = I] | A(n, p)| < I] (Cip”) = O(q;°**). 


P\|q1 P\q } 


Hence 
X | At, @)| = °(F | ee srsft 
q@=1 T 
27q192>0 
sa ofr ym > .s g, ttt at) 
qi=1 g2|n? r=0 
= O(n *n‘). 
Proof of the Theorem 
By (28), 


wie 2s | T*(a, a, q)U(a)e(—Na) da 


qsP* a 


-~ ZY, Cf Pea, dU%@e(—Na) da 
+ DL ae — Pa, a, d)U%ade(—Na) de 


+ [ T*(a)U%(a)e(—Na) de. 


By Lemma 13, the second sum is O(U’). By Lemma 12, the third sum is 
O(U*P***). By Lemma 15, the final integral is O(U?P* +***), By (17), the 
last of these three error terms is the largest. Also the first sum, by Lemma 16, is 


> pe » > [ T*(a, a, ge(una + uja — Na) da 


h=1 j=1 qsP 


U 


=> > Do, LL 1 (See)°eq(—(N — un — ua)R(N — wu — u) 


h=1 j=l qsP? a 
U 


mM z A(N — uw, — u;, @R(N — ws — uy). 


h=1 j=1 qsPt 


By Lemma 26, 


en a. uj, q) = S(N — um — u) + O(P *). 


qsP 


Also, since u,, u; < P** = o(P*),n = N — uw — u; satisfies the condition of 
Lemma 16. Thus in the above sum, R(N — u, — u,;) satisfies 


(33) 2°P? < RIN — wm —u,) < 2P*. 
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Hence 
U U 
ry(N) = p> > S(N — uw — u)R(N — w — uj) + O(U? PP’) 


+ O(U* Pets) 


By (15), N — ua — u; = 1, 2, 3, 4, 5, or 6 (mod 16). Hence, by Lemma 25, 
(N — ua — us) > Cs. Thus the above sum is greater than 


U°C;2-P’. 
By (17), 
1.72 1 
Hence 
ru(N) > 0. 
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L’ASPECT QUALITATIF DE LA THEORIE ANALYTIQUE DES 
POLYNOMES 


PAR J. DIEUDONNE 


(Received March 21, 1939) 


1. La plupart des problémes qui se posent dans ce que j’ai proposé récemment' 
d’appeler la théorie analytique des polynomes, rentrent dans le schéma général 
suivant: on considére une famille de polynomes d’une variable complexe, 
de degré borné, et qui dépendent d’un certain nombre de paramétres variables; 
il s’agit de savoir s’il existe, et dans l’affirmative de déterminer, des régions du 
plan complexe (non identiques au plan tout entier) telles que tout polynome de la 
famille posséde, dans une telle région, un nombre de zéros au moins égal a un 
nombre donné r > 0. 

La formulation méme d’un tel probléme montre qu’il se décompose naturelle- 
ment en deux parties: la premiére, qu’on peut appeler la partie qualitative du 
probléme, consiste 4 déterminer les nombres r pour lesquels on peut affirmer 
existence de régions ayant la propriété désirée; cela fait, on doit passer 4 la 
partie quantitative du probléme, c’est-a-dire déterminer ces régions pour chacune 
des valeurs de r ov leur existence est assurée. 

Je me propose, dans cet article, de montrer comment, dans tous les cas envi- 
sagés jusqu’ici, la partie qualitative du probléme précédent peut, par |’emploi 
d’une méthode uniforme, se ramener 4 un autre probléme dont la difficulté est 
sensiblement moindre dans la plupart des cas. Cette méthode s’appuie sur des 
propriétés de compacité (ou de familles normales, pour employer la terminologie 
de M. P. Montel); elle n’est d’ailleurs pas essentiellement nouvelle, et on la 
retrouve ga et la dans la plupart des travaux sur la théorie analytique des 
polynomes.” Mais on ne semble pas avoir remarqué avec quelle facilité elle 
s’applique 4 de nombreux problémes résolus jusqu’a présent par des moyens 
trés particuliers et trés divers, et souvent moyennant des hypothéses superflues. 

Ce fait accentue encore le contraste entre l’aspect qualitatif et l’aspect quanti- 
tatif de la Théorie analytique des polynomes, car les résultats quantitatifs 
obtenus jusqu’ici l’ont tous été au moyen d’artifices variés ne se rattachant 4 
aucune idée générale. 





1 J. Dieudonné, La théorie analytique des polynomes d’une variable (Mémorial des Sciences 
mathématiques, fasc. 93). Dans ce qui suit, nous nous référons 4 cet opuscule en le dési- 
gnant simplement par ‘“‘Mémorial.”’ 

? Voir en particulier P. Montel, Sur quelques limites pour les modules des zéros des poly- 
nomes (Comment. Math. Helv., t. 7, 1934-35, p. 178-200), et J. Dieudonné, Sur la variation 
des zéros des dérivées des fractions rationnelles (Ann. de l’Ecole Normale Supérieure, 3¢ 
série, t. 54, 1937, p. 101-150); dans ce dernier travail, la méthode générale exposée ici est 
appliquée a l’étude qualitative compléte d’un probléme particulier. 
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9. Précisons d’abord le probléme énoncé ci-dessus. Soit N le degré maximum 
des polynomes de la famille F; nous ferons la convention suivante: tout polynome 
de la famille F dont le degré n sera inférieur 4 N sera considéré comme ayant un 
,éro multiple d’ordre N — nau point a l’infini. Nous supposons en outre que la 


famille F ne contient pas le polynome identiquement nul: tout polynome de ‘F 


posséde alors exactement N zéros (chacun compté avec son ordre de multiplicité) 
dans le plan fermé par l’adjonction du point a ]’infini. 

Comme il s’agit d’étudier la répartition des zéros, on peut supposer que, si la 
famille  contient un polynome P(z), elle contient aussi le polynome aP(z), 
quelle que soit la constante — a ~ 0. 

Si P(r) = ao + aye + age” + --- + aya”, on peut faire correspondre A ces 
polynomes le point de l’espace projectif shingliens ad N dimensions P™ dont les 
coordonnées homogénes sont (a, a, @2,---,@y). A la famille F correspond 
ainsi un sous-ensemble F de P”, de sorte que tous les polynomes de * corres- 
pondant 4 un méme point de F ont les mémes zéros. 

Dans l’énoncé du probléme général, nous avons parlé de régions du plan con- 
nant toujours r zéros au moins de tout polynome de la famille ‘; précisons 
maintenant que nous entendons par la des sous-ensembles fermés du plan complexe 
fermé par adjonction du point 4 l’infini. Dans ces conditions, on peut aussi, en 
vertu de la continuité des racines d’une équation en fonction des paramétres, 
supposer fermée la famille , autrement dit supposer que F est un ensemble 
fermé dans P”; d’ou, en vertu de la compacité de l’espace projectif complexe, 
résulte que F est compact. 


3. Ces préliminaires étant posés, nous cherchons donc les valeurs de |’entier r 
telles qu’il existe un ensemble fermé E, , non identique au plan tout entier, et 
contenant r zéros au moins de tout polynome de la famille ‘Ff; il est clair que, si r 
posséde cette propriété, tout entier r’ < r la posséde également; il suffit done de 
chercher le maximum p des nombres r pour lesquels la propriété précédente est 
vraie. Nous désignerons cette recherche sous le nom de probléme global. 

Pour le résoudre, étudions d’abord le probléme qui se pose dans les mémes 
termes, avec la seule différence qu’on impose en outre aux ensembles £, la 
condition de ne pas contenir un point donné x du plan. Désignons par o(20) le 
maximum des nombres r pour lesquels il existe des ensembles E, satisfaisant & 
cette condition supplémentaire; nous appellerons probléme local (relatif au point 
%) la recherche de ce nombre. II est clair qu’on a ensuite p = Max o(zo) lorsque 
% parcourt le plan fermé; la résolution du probléme local pour tout point du plan 
entraine donc celle du probléme global. 

Or, le nombre o(2o) peut encore étre défini par les deux conditions suivantes: 

1° quel que soit le voisinage V de 2», il existe un polynome de la famille F 
possédant au moins N — o(xo) zéros dans V; 

2’ il existe un voisinage V» de x tel que tout polynome de la famille ‘F posséde 
au plus N — o(ao) zéros dans Vo . 

Désignons alors par X(2o) U’ordre de multiplicité maximum du point x comme 
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zéro des polynomes de la famille F (A(x) = 0 si aucun polynome de ne s’annule 
en 2%). Nous allons voir qu’on a 


(1) a(t) = N — X(a). 


En effet, comme il existe au moins un polynome de Ff ayant au point 2» un zéro 
multiple d’ordre \(%), donc ayant au moins \(%) zéros dans le voisinage V), ona 


(2) A(t) S N — o(2) 


D’autre part, soit (V;) (¢ = 1, 2, ---) une suite de voisinages de 2» dont I’inter- 
section se réduise au point 2 ; 4 chaque indice 7 correspond un polynome P;(z) 
de la famille  possédant au moins N — o(x) zéros dans V;. Comme F est 
compact, on peut extraire de la suite (P;(x)) une suite partielle convergeant vers 
un polynome P)(zx) de Ff, et, d’aprés la continuité des racines, Po(x) posséde un 
zéro multiple d’ordre N — o(x) au moins au point 2 , d’ot 


(3) N — o(%o) S Ao) 


La comparaison de (2) et (3) donne la relation (1). On en déduit que, si u est 
le minimum des nombres \(x) lorsque 2 parcourt le plan fermé, on a 


(4) p=N-—-u. 


4. Pour appliquer les résultats précédents, il est essentiel que F soit fermé; 
s’il n’en est pas ainsi, il faut commencer par remplacer F par son adhérence F, 
c’est-a-dire adjoindre a F les points d’accumulation de toutes les suites de points 
de F, qui n’appartiennent pas 4 F. Dans le calcul de (ao), il faudra faire entrer 
en ligne de compte, non seulement les polynomes de la famille donnée, mais 
aussi ceux qui correspondent aux points de F — F. 

Par exemple, la famille ‘F est donnée le plus souvent par |’expression des 
coefficients a , a, --- , dy en fonction de parameétres indépendants dont chacun 
est assujetti 4 décrire un certain sous-ensemble du plan complexe; si les sous- 
ensembles ot varient certains de ces paramétres ne sont pas fermés (dans le plan 
fermé par adjonction du point 4 l’infini), il faudra adjoindre a l’ensemble F les 
points d’accumulation des suites qu’on obtient en faisant tendre chacun des 
paramétres considérés vers un point d’accumulation quelconque de |’ensemble 
ou il varie. C’est ainsi que, souvent, certains paramétres peuvent prendre 
toutes les valeurs complexes finies; il faut alors adjoindre a l’ensemble F les 
points d’accumulation des suites qu’on obtient en faisant tendre ces paramétres 
vers le point 4 l’infini. Par exemple, 4 la famille des polynomes de la forme 
1+ 2+ az’ + bz’ od a et b prennent toutes les valeurs complexes finies, il faut 
adjoindre tous les polynomes de la forme az” + ba’, od a et b sont encore des 
nombres finis arbitraires. 

Considérons de méme la famille F formée des numérateurs des dérivées 
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donnés) dont chacun des péles et des zéros décrit un sous-ensemble fermé donné; 
il peut exister certaines positions de ces points pour lesquelles la fraction se 
réduit 4 un polynome de degré inférieur 4 m, et ces positions doivent par suite 
étre exclues, puisque la dérivée m* est alors identiquement nulle; mais il faut 
adjoindre a l’ensemble F les points d’accumulation des suites qu’on obtient en 
faisant tendre les zéros et les pdéles des fractions rationnelles considérées vers les 
positions singuliéres précédentes; et les polynomes correspondant & ces points 
d’accumulation n’appartiennent pas nécessairement & la famille  initiale.* 


5. Donnons maintenant quelques exemples d’application de la méthode que 
nous venons de décrire. 
Considérons en premier lieu la famille des polynomes 


(5) 1+ ax + aor? + .-- + 0,2” 


de degré n donné, dont certains des coefficients a; , --- , a, sont fixes, les autres 
pouvant prendre toutes les valeurs complexes finies; la détermination du nombre 
o(«), et des ensembles bornés #, contenant au moins r zéros de chacun de ces 
polynomes (pour chaque r < o()) constitue ce qu’on appelle le probléme de 
Landau-Montel. D’aprés ce qu’on a vu plus haut, il faut tout d’abord adjoindre 
aux polynomes (5) ceux qu’on obtient en remplagant par 0 les coefficients de 
(5) qui sont fixés; on obtient ainsi une famille fermée. Soit alors a, le premier 
coefficient de (5) qui soit variable, a, le dernier coefficient donné et différent de 0; 
o() est le degré minimum des polynomes de la famille considérée (fermée comme 
il vient d’étre dit); or, il est immédiat que ce degré est le plus petit des nombres 
P, 9, ce qui résout qualitativement le probléme de Landau-Montel.* 

De fagon générale, pour une famille fermée de polynomes, le nombre o(@ ) 
sera le degré minimum des polynomes de *. 


6. Considérons maintenant la famille des polynomes de la forme 
(6) aP;(x) + a2Po(x) + --- + axPx(z) 


ou a, --- , a, peuvent prendre toutes les valeurs complexes finies, et od P,(z) 
est un polynome de degré n (n = k), dont tous les zéros sont assujettis 4 varier 
dans un domaine circulaire fermé C; (i = 1, 2,---, k). En outre, nous sup- 
poserons d’abord que les domaines circulaires C; , C2, --- , Cy n’ont aucun point 
commun deux a@ deux. 

Cette famille est fermée; nous allons montrer qu’onap = n—k+ 1. En 
effet, il existe des points n’appartenant A aucun des C; ; soit 2 l’un d’eux; il 
nous suffira de montrer que \(zo) = k — 1 (car en un point yo appartenant 4 un 





_*On montre cependant que ce sont encore des numérateurs de dérivées m*™ de frac- 
tions rationnelles de degré n (voir le mémoire de l’auteur cité dans la note 2, p. 110, lemme 1). 


‘ Ce raisonnement ne différe que par la forme de celui de M. Montel dans le mémoire cité 
plus haut. 
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zéro des polynomes de la famille F (A(z) = 0 si aucun polynome de ne s’annule 
en 2%). Nous allons voir qu’on a 


(1) o(o) = N — Xa). 


En effet, comme il existe au moins un polynome de ayant au point x un zéro 
multiple d’ordre d(x), done ayant au moins A(z») zéros dans le voisinage V» , ona 


(2) (mo) S N — a(t) 


D’autre part, soit (V;) (¢ = 1, 2, ---) une suite de voisinages de 2 dont |’inter- 
section se réduise au point 2 ; 4 chaque indice 7 correspond un polynome P;(z) 
de la famille F possédant au moins N — o(x) zéros dans V;. Comme F est 
compact, on peut extraire de la suite (P;(x)) une suite partielle convergeant vers 
un polynome P (x) de *&f, et, d’aprés la continuité des racines, Po(x) posséde un 
zéro multiple d’ordre N — o(a) au moins au point 2% , d’ot 


(3) N — a(x) S (xo) 


La comparaison de (2) et (3) donne la relation (1). On en déduit que, si u est 
le minimum des nombres A(x) lorsque 2 parcourt le plan fermé, on a 


(4) p=N-—u. 


4. Pour appliquer les résultats précédents, il est essentiel que F soit fermé; 
s’il n’en est pas ainsi, il faut commencer par remplacer F par son adhérence F, 
c’est-a-dire adjoindre a F les points d’accumulation de toutes les suites de points 
de F, qui n’appartiennent pasa F. Dans le calcul de \(2o), il faudra faire entrer 
en ligne de compte, non seulement les polynomes de la famille donnée, mais 
aussi ceux qui correspondent aux points de F — F. 

Par exemple, la famille est donnée le plus souvent par |’expression des 
coefficients da , a; , --- , dy en fonction de parameétres indépendants dont chacun 
est assujetti 4 décrire un certain sous-ensemble du plan complexe; si les sous- 
ensembles ot varient certains de ces paramétres ne sont pas fermés (dans le plan 
fermé par adjonction du point 4 |’infini), il faudra adjoindre a l’ensemble F les 
points d’accumulation des suites qu’on obtient en faisant tendre chacun des 
paramétres considérés vers un point d’accumulation quelconque de |’ensemble 
ou il varie. C’est ainsi que, souvent, certains paramétres peuvent prendre 
toutes les valeurs complexes finies; il faut alors adjoindre 4 |’ensemble F les 
points d’accumulation des suites qu’on obtient en faisant tendre ces paramétres 
vers le point 4 l’infini. Par exemple, A la famille des polynomes de la forme 
1 + 2+ az’ + bz’ od a et b prennent toutes les valeurs complexes finies, il faut 
adjoindre tous les polynomes de la forme az” + ba’, ot a et b sont encore des 
nombres finis arbitraires. 

Considérons de méme la famille F formée des numérateurs des dérivées 
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donnés) dont chacun des péles et des zéros décrit un sous-ensemble fermé donné; 
il peut exister certaines positions de ces points pour lesquelles la fraction se 
réduit A un polynome de degré inférieur 4 m, et ces positions doivent par suite 
étre exclues, puisque la dérivée m* est alors identiquement nulle; mais il faut 
adjoindre 4 l’ensemble F les points d’accumulation des suites qu’on obtient en 
faisant tendre les zéros et les péles des fractions rationnelles considérées vers les 
positions singuliéres précédentes; et les polynomes correspondant 4 ces points 
d’accumulation n’appartiennent pas nécessairement & la famille  initiale.* 


5. Donnons maintenant quelques exemples d’application de la méthode que 
nous venons de décrire. 
Considérons en premier lieu la famille des polynomes 


(5) 1 + ay + age? + --- + 4,2" 


de degré n donné, dont certains des coefficients a; , --- , a, sont fixes, les autres 
pouvant prendre toutes les valeurs complexes finies; la détermination du nombre 
o(«), et des ensembles bornés #, contenant au moins r zéros de chacun de ces 
polynomes (pour chaque r S o(@)) constitue ce qu’on appelle le probléme de 
Landau-Montel. D’aprés ce qu’on a vu plus haut, il faut tout d’abord adjoindre 
aux polynomes (5) ceux qu’on obtient en remplagant par 0 les coefficients de 
(5) qui sont fixés; on obtient ainsi une famille fermée. Soit alors a, le premier 
coefficient de (5) qui soit variable, a, le dernier coefficient donné et différent de 0; 
o(%) est le degré minimum des polynomes de la famille considérée (fermée comme 
il vient d’étre dit); or, il est immédiat que ce degré est le plus petit des nombres 
P, 9, ce qui résout qualitativement le probléme de Landau-Montel.* 

De fagon générale, pour une famille fermée de polynomes, le nombre o( ) 
sera le degré minimum des polynomes de *. 


6. Considérons maintenant la famille des polynomes de la forme 
(6) a,P;(x) + aeP2(x) + --- + oP: (x) 


ou a, --- , a, peuvent prendre toutes les valeurs complexes finies, et od P;(z) 
est un polynome de degré n (n = k), dont tous les zéros sont assujettis 4 varier 
dans un domaine circulaire fermé C; (¢ = 1, 2,---, k). En outre, nous sup- 
poserons d’abord que les domaines circulaires C; , C2, --- , C, n’ont aucun point 
commun deux a& deux. 

Cette famille est fermée; nous allons montrer qu’onap =n—k+1. En 
effet, il existe des points n’appartenant A aucun des C; ; soit 2 l’un d’eux; il 
hous suffira de montrer que A(z») = k — 1 (car en un point yo appartenant 4 un 





_* On montre cependant que ce sont encore des numérateurs de dérivées mms de frac- 
tions rationnelles de degré n (voir le mémoire de |’ auteur cité dans la note 2, p. 110, lemme 1). 

‘ Ce raisonnement ne différe que par la forme de celui de M. Montel dans le mémoire cité 
plus haut. 
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des C; , on a évidemment A(yo) = n). Cela revient 4 voir que le déterminant 


Pi(%o) = Pa(xo) -- + Px(ao) 
P3(20) P2(a0) «++ Pi(ao) 


| ofted ES Pt” (a) 


n’est pas nul. Or ce déterminant est linéaire par rapport 4 chacune des racines 
des P,(x), considérées comme variables; d’aprés le théoréme de Grace,’ si ce 
déterminant était nul, il existerait, dans chacun des domaines C;, un point z; 
tel que 


(x0 — 21)" (to — 22)" +++ (Ho — ae)” | 
(xo ~e #)"" (x0 —_ %)*" ee (xo > zz)” ae 
(xo aoe ay” (ao ne ars bi a (ao “a art 
Or ce déterminant est égal au produit des facteurs (xo — 2;)" “"’ (i = 1,2, --- , k) 


par le déterminant de Vandermonde des quantités (x — 2;), et il résulte des 
hypothéses faites qu’aucun de ces facteurs ne peut étre nul, d’ow la proposition. 

L’hypothése faite sur les C; est essentielle; montrons en effet que si deux do- 
maines circulaires C; , C; ont un point intérieur commun % ,onap = 0. Prenons 
en effet P;(x) = (x — 2)", et soit a un nombre assez petit pour que le polynome 
(x — 2)" + a(x — 2)” ait toutes ses racines dans C;, ce qui est possible d’aprés 
V’hypothése; en prenant P(x) égal 4 ce polynome, a; = 1, a; = —1, et a, = 0 
pour les indices h différents de 7 et 7, le polynome (6) a un zéro multiple d’ordre 
n au point 2 , d’ou la proposition. 


7. Soit F, la famille des polynomes de la forme 


m+1 m+1 m+1 a 
(x — a1)"" (x — 22) ++ (@ — n) eee 
(7) 
i are jigs Se 
+ @aayen tT @ a 

ot (a; , @2, +++ ,@n) décrit, dans l’espace projectif complexe P” un ensemble 
fermé G de sorte qu’il existe un nombre k > 0 tel que, pour tout choix d’indices 
4 ,%2, +++ , t parmi les nombres 1, 2, --- , n, on ait 
(8) la, +a, +--+ +a;,| = k-Max | ai, | 


lsrsp 


C’est le cas par exemple lorsque tous les a; ne prennent que des valeurs positives 
ou nulles. 
En outre, 2941, X42, -++ ,2n peuvent prendre toutes les valeurs complexes 





5 Voir Mémorial, p. 12, théoréme VIIc. 
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finies; enfin, 41, 22, +++, 2, varient arbitrairement dans un ensemble borné 
fermé E. 

On peut évidemment supposer que Maxi<ic, | a; | = 1; d’aprés (8) le coefficient 
de la plus haute puissance de x dans le polynome (7) a son module compris entre 
k et n; lorsque les 2; varient en restant bornés, et que le point (a; , a2, --- , n) 
varie arbitrairement dans G, on n’obtient, comme limites de suites de polynomes 
de ‘F, , que des polynomes de cette famille. 

Pour fermer F,, il suffit done de considérer le cas ot: un certain nombre de points 
2; (q + 1 St S n) tendent vers le point 4 !’infini, les autres restant bornés et le 
point (a;, @2,---,@,) variant arbitrairement dans G; si, parmi les indices i 
pour lesquels x; reste borné, il en existe un tel que a; ne tende pas vers 0, on voit 
immédiatement qu’on obtient, comme limites, des polynomes des familles ‘f, 
définies comme ‘f, , mais pour un nombre r de termes compris entre q et n. Si 
on n’est pas dans ce cas, les limites sont, soit des polynomes des familles ff, , 
soit des polynomes qu’on obtient en ajoutant une constante arbitraire dans le 
crochet des formules analogues 4 (7) définissant les polynomes des familles 
5, pourr <n, 

Cela étant, la condition (8) montre immédiatement que, dans les polynomes de 
la famille fermée ainsi obtenue, le terme de plus haut degré est au moins de 
degré (m + 1) (¢g — 1), cette borne inférieure étant effectivement atteinte 


lorsqu’on laisse fixes les a; et qu’on fait tendre 2,41, --- , 2%, vers le point a 
l'infini. On a done 
(9) a(0) = (m + 1)q — 1) 


ce qui généralise un théoréme de M. Fekete.° 


8. Un raisonnement analogue montrerait que si on considére encore les 
polynomes de la forme (7), avec les mémes hypothéses sur les x; , mais ot les a; 
peuvent prendre toutes les valeurs complexes finies, on a cette fois’ 


(10) a(o) = mq — 1) 


d’ol, par transformation homographique, on déduit aisément que l’on a aussi 
p = m(q — 1) pour cette famille de polynomes. 

En particulier, siz; , 22, --- , 2, sont fixes et distincts, les a; variant arbitraire- 
ment, on ap = m(n — 1). Par un raisonnement de continuité facile, on étend 
ce résultat aux polynomes de la forme 


(11) Pi(z)Pa(x) «++ Pa(z) Ee -f Pala) Ti * Pas 


ot P,(x) est un polynome de degré m + 1, dont tous les zéros varient dans un 
Voisinage suffisamment petit d’un point 2z;, les points 2; étant tous distincts 


* Mémorial, p. 60, théoréme XLVIII. 
"J. Dieudonné, loc.cit. (note 2), p. 146. 
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(¢ = 1, 2,---+,m) et les a; arbitraires. Mais on n’a pas ici de résultat aussi 
simple que pour les polynomes de la forme (6), en ce qui concerne la détermina- 
tion des domaines ot peuvent varier les zéros des P;(x) pour que le résultat 
précédent demeure valable. On peut seulement montrer que p = 0 si on 
n’impose aucune condition & ces domaines: il suffit de prendre P;(x) = x”*' — b, : 
le b; étant distincts, et de considérer la décomposition en éléments simples de la 
fraction 


1 
(y — bi)(y — be) --+ (y — dn) 


en y remplagant y par x””’, pour voir que le polynome (11) peut se réduire A une 
constante non nulle. 

Signalons encore deux résultats analogues 4 celui du n°7. Si, dans les poly- 
nomes (11), les zéros de chacun des P;(x) varient dans un ensemble fermé borné, 
et si de plus les a; satisfont 4 la condition 

[ai + a2 +--+: +a,| = k-Max |a;| 


lsisn 





on aa(«) = (m+ 1) (nm — 1) pour ces polynomes, ce qui généralise qualitative- 
ment un résultat de M. M. J. v. sz. Nagy et Marden.* 

Si maintenant on considére les polynomes de la forme (11), ot Pi(x) = 
(2 — a,)”*, et ot on fait sur les a; et les x; les mémes hypotheses qu’au n°7, le 
méme raisonnement que dans ce n° montre que 


a(o) = pr + Pot +++ + Pq — Max (pi, po, «++ , Pa) 
formule qui redonne bien (9) quand les p; sont tous égaux 4 m + 1. 


Nancy, FRANCE 





8 Mémorial, p. 55, théoréme XLII. 
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ANNALS OF MATHEMATICS 
Vol. 40, No. 4, October, 1939 


STRUCTURE AND AUTOMORPHISMS OF SEMI-SIMPLE LIE GROUPS 
IN THE LARGE’ 


By N. Jacosson 


(Receivéd May 3, 1939) 


The present paper attempts to fill in several gaps in the literature relating Lie 
algebras to Lie groups. It is well known that there is a (1-1) correspondence 
between Lie subalgebras of the Lie algebra & of G and its closed local subgroups. 
However in general different closed local subgroups may generate the same closed 
subgroup of G. We can show nevertheless that © is semi-simple (simple) if 
and only if £ is semi-simple (simple). We also give a new set of linear groups 
which represent the classes of locally isomorphic simple Lie groups and which is 
somewhat simpler than Cartan’s original list.? Omitting a finite number of 
exceptions these are merely the important geometric linear groups with real 
complex and quaternionic elements (unimodular, orthogonal and symplectic® 
groups). We determine the group % of bicontinuous automorphisms of these 
“unexceptional” groups and discuss the structure of %/%, ¥ the set of inner 
automorphisms. 

1. We begin with a brief resumé of the local theory. A topological space U 
is a local group* (group germ, group nucleus) if it contains an open set V in 
which a composition zy in U is defined such that 

1. If x, y, 2, zy, yz are in V then x(yz) = (zy)z. 

2. There is a point 1 in V such that lz = 21 = 2 for any zin V. 

3. For each xin V there is an xin V such that zx = x ‘2 = 1. 

4. zy and « ’ are continuous in x and y. 

U is a local Lie group if it is an r-cell and in place of 4. we have 

4’. zy and x’ are analytic in x and y in the sense that the codrdinates are 
analytic functions of those of z and y. 

W is a local subgroup of U if it is a local group relative to the same composition 
and topology as defined in U. Two local subgroups are regarded as identical if 
their intersection is open in each of the local groups. W is invariant if for each 
neighborhood W, of 1 in W there exist symmetric’ neighborhoods N and Z of the 
identity in U and W respectively such that x ‘zr e Wi if re NandzeZ. Two 





‘ Presented to the National Academy of Sciences Oct. 24, 1938. 

? Cartan [2]. 
* This term has been introduced by Prof. Weyl (The classical groups, Princeton 1939) 
in place of the earlier and undesirable terms Abelian or complez linear groups. 

‘Sometimes called group germ or group nucleus. The present term has been adopted in 
the forthcoming translation of Pontrjagin’s book on continuous groups. 

*W1is symmetric if W7? the set of points wy1, win W:, = W:. 
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756 N. JACOBSON 


local groups U and U’ are isomorphic if there exists a homeomorphism x — ,’ 
between suitable symmetric neighborhoods N and N’ of 1 and 1’ such that if 
a, y, zy are in N then z’y’ = (zy)’isin N’. It follows that 1 > 1’, 2" = (z’)7. 
We may also suppose that if x’, y’, z’y’ are in N’ then zy is in N.° 

Suppose now that U is a local Lie group and let M be a cell neighborhood of | 
in which canonical parameters’ are defined. Then ifz = (&,---,&)eNa 
concentric sphere of noes the radius of M then 2” has codrdinates 2¢; and has a 
unique square root z'in N. We remark that x may not have a unique n" root 
(n > 2) in N but it has a unique proper n“ root y in the sense that y, y’, - - . y" 
are allin N. For our purpose the existence and uniqueness of square roots 

suffices. We define 2? = (z')*,... , 27?" = 2???" o!™ if 0 < p < 2” and 

oP" = (?")-! We define x* for t's Sasl by a limiting process. Finally 
if|a|> 1 but (a&, --- , vé,) is still in N we define x” as the element y in N 
such that y'"/* = 2. The canonical coérdinates of x* are aé;. Since the usual 
rules for powers hold the elements x“, z fixed form a 1-dimensional local subgroup 
and these local subgroups cover the neighborhood N without overlapping. 

It has been shown by G. Birkhoff* that the Lie algebra associated with U may 
be introduced in the following way: If z and y are in a suitable neighborhood N’ 
of 1 then lima-o (x*y*)"* = 2 + yand lima-o (2*y*r *y*)" = [z, yl exist. The 
coérdinates of x + y are &; + nif x = (&,--- , &), i =(m,---,,). [2, y]is 
bilinear in x and y and satisfies 


[x,y] = —ly, x] [a, ly, zl] + ly, lz, z]] + le, [z, yl] = 0 


(—a denotes the elements with coérdinates — £;,2 + (—z) =0=1). Wemay 
regard N’ as the interior of a sphere in r-dimensional Cartesian space % and xz + y 
as vector addition in this space. If x and y are arbitrary in % we may choose 
p ~ 0 so that pz, py « N’ and define [z, y] = p ‘[pzx, py]. This is independent of p 
and together with the operation + defines & as a Lie algebra over the field of real 
numbers. We call & the Lie algebra of U. The operations in & have been defined 
by algebraic and topological processes in U. Conversely we may express the 
group operation by 


ryy=xtyti3[z,ylt+- 


where all the terms are pure commutators. It follows that any isomorphism 
between local Lie groups induces an isomorphism between their Lie algebras and 
conversely. 

It has been shown by E. Cartan that if W is a closed local subgroup of U there 
is a neighborhood of 1 in W whose elements all belong to a subalgebra It of g.’ 





® If we choose N, so that N? C N and the corresponding N{ then this will hold for N, in 
place of N. 

’ Cf. Eisenhart [1], p. 44 or Birkhoff [1], p. 73. 

§ Birkhoff [1]. Cf. also Pontrjagin’s book. For the definitions needed from the theory 
of Lie algebras see Jacobson [1], p. 875. 

* Cartan [3], p. 22. 
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gt may be defined as the set of all the limiting directions of elements of W con- 
verging tol. Thus W isa local Lie group with QM as its Lie algebra. M = 0 
if and only if W is diserete. It follows also that any totally disconnected closed 
local subgroup of U is discrete. W is invariant if and only if M is an ideal. 


2. If U is a neighborhood of 1 in a topological group © it is a local group rela- 
tive to the multiplication and topology defined in G. We recall that @, and G, 
are locally isomorphic if they have isomorphic local groups U; and Uz. Gisa 
Lie group if U can be taken as a local Lie group. It follows that the Lie groups 
@, and G, are locally isomorphic if and only if their Lie algebras (the Lie algebras 
of the local Lie groups U; and U2) & and & are isomorphic. If G, is connected 
and simply connected Schreier has shown that if U; is any neighborhood of 1 in 
@, there exists a neighborhood V; C U; such that the elements of G, are repre- 
sentable as vyv2 --- Um, Ve in Vy and vv, --- Vm = 1 if and only if this relation 
can be reduced to 1 = 1 by a sequence of identifications wv = w where u, v, w 
arein U,.° It follows that if G, is connected any local isomorphism between @, 
and ®, is obtained from a unique continuous homomorphism of @, into Gz . 
The group D mapped into 1 in this mapping is discrete. If G: is simply con- 
nected also, D = 1 and the mapping is an isomorphism. In particular if G, = 
®%, = & any local automorphism corresponds to a (bicontinuous) automorphism 
of &. If G is not simply connected there may exist local automorphism which 
do not generate automorphisms in the large. On the other hand any auto- 
morphism determines a local automorphism of © and if Gis connected distinct 
automorphisms will define distinct local automorphisms. Since the latter induce 
linear transformations in 2 it is natural to topologize the group Y% of auto- 
morphism using the usual Euclidean topology in the space of linear transfor- 
mations. 

Any closed subgroup © of @ determines a closed local subgroup W of U and 
hence a subalgebra IN of L. Hence His a Lie group. If § is invariant so is W 
and hence Yt is an ideal. If Mis any subalgebra of & the closed local group W 
determined by Yt may not be a neighborhood of 1 of any closed subgroup of 
." If S* is the smallest closed subgroup containing W, W* its local group and 
* its Lie algebra then W* => W and M* = Mt. If Mt is an ideal W is an 
invariant local group and $* is invariant in ©; the component of 1 in @. Hence 
Y* is an ideal. Similarly if M is commutative so is M’*. Since the elements 
(g°h“g “h-*)"* converge in the, direction through [g, A] it follows from Cartan’s 
theorem that the Lie algebra of the derived group” G’ contains the derived 
algebra 2’. 

We shall call a topological group G topologically semi-simple or briefly t-semi- 
simple if every closed commutative invariant subgroup of © is discrete. If G, is 


Schreier [1], p. 25. 
"A piece of a geodesic in an irrational direction on the torus is an example of this type. 
* The smallest closed subgroup containing all the commutators gi'gz'gige . 
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locally isomorphic to G, $: a closed commutative invariant subgroup of G, 
then §, is discrete. For, otherwise ; determines a local subgroup of this type 
and hence by the local isomorphism a commutative invariant local subgroup not 
discrete in G. If G is connected and D a discrete invariant subgroup, D < ¢ 
the center of G.* Hence if D is a commutative invariant subgroup its closure D 
is discrete and D = D S Calso. The natural homomorphism between @ and 
@/€ is a local isomorphism. Hence G/CC is t-semi-simple. The group ©, = ¢ 
which corresponds to the center of G/€ is discrete and invariant.“ Thus 
€, = G, G/€ has no commutative invariant subgroup ~1 and hence no discrete 
invariant subgroup ~1. 

© is t-simple if every closed invariant subgroup of © is either open or discrete. 
It follows that © is t-semi-simple unless its component of 1 is commutative. If 
@® is connected its closed invariant subgroups ~© are discrete. Then either 
© = € or the latter is discrete. Furthermore @/€ is ¢-simple and has no 
discrete invariant subgroup #1. Hence G/€ is simple in the usual sense. We 
suppose henceforth that if © is ¢-simple it is also t-semi-simple. 

Now suppose that © is a ¢semi-simple Lie group, % its Lie algebra. & is 
semi-simple. For if tis a commutative ideal, 5* the smallest closed subgroup 
containing Jt is commutative and invariant. The converse that © is t-semi- 
simple if 2 is semi-simple is trivial. If © is ¢-simple (and ¢-semi-simple) & is 
semi-simple and if not simple, 2 = &% @® &'° where &% and & are proper ideals. 
&,; generates G, the component of 1 but since the elements of &% and % commute, 
@, is commutative contrary to assumption. Conversely if % is simple, © is 
t-simple. 

Suppose again that G is ¢-semi-simple and & is an ideal in &, G, the closed 
invariant subgroup determined by & and & the Lie algebra of G,. Since Y 
is an ideal in & it is semi-simple and hence £7; = & @ Mt. The elements of M 
commute with those of %; and hence they generate a subgroup of the center of 
G,. It follows that Mt = 0, gr = &. Hence in this case we have a (1-1) 
correspondence between closed connected invariant subgroups of © and ideals of 
£. Suppose G is connected and 2 = % @ --- @ & , & simple and G; the group 
generated by 2;. Then G@ = GG, --- G,. The component of 1 of D; = 
G; N (Gi --- G-aGiys --- G,) corresponds to 2; M (V+ --- + Va + Bu 
+ --. + %) = 0 and hence consists of the point 1. Thus 9; is discrete and is 
contained in € the center. If € = 1, G is a direct product of simple groups. 
In the general case G/ € is a direct product of simple groups. It has been shown 
by Cartan” that the inner automorphisms (G/€) form the component of 1 in 





13 If de D then the set of elements g- dg, g eG is connected since g — g ‘dg is a con- 
tinuous mapping. Hence g~' dg consists of a single point. i.e. g-'dg = 1. 

14 If S/€ and G are discrete then © is discrete. For € the inverse image of the open set 1 
is openin §. Since € is discrete any point contained in € is also open in § and hence is 
discrete. 

16 Cartan [1], p. 52. 

16 Cartan [4], p. 8. 
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the group of automorphisms %{. Thus this component has no commutative 
invariant subgroup and if G is t-simple the component is simple. 


3, Let t be a complete valued associative ring with an identity” i.e. there is 
defined a real valued function | 2 | for z in R such that 

1. |x| 2 0, = Oif and only if z = 0, 

2. | x | se | —z | ’ 

3. (e+y| Sleltiyl, 

4.|cy|S\|el|lyl, 

5. %is complete in the topology defined by the metric d(x, y) = |x — y|. 

The set of units win R forma group U. Since the product is continuous in R 
by 3 and 4 it is continuous in the subspace U. If |x| < 1 it follows as usual 
thty=1+2+2° + .-.-- exists and (1 — z)y = y(1— 2) =1. Alsofora 
given « > 0 there exists 6 > 0 such that | z| < éimplies| y — 1| <«. Hence 
lis open in R and 2’ is continuous at z = 1. Now suppose N is any open set 
containing a * then aN is open and contains 1 since x — za is a homeomorphism. 
There exists a neighborhood M of 1 such that M~' < aN. Hence N’ = Mais 
an open set about a such that (N’)" < N. Thus zx” is continuous at every 
point and lis a topological group. 

We suppose now that 9 is a simple algebra over R the field of real numbers, 
ie. R = RK, , Cy, or Q, the set of n X n matrices with elements in the real, complex 
or quaternionic fields. If x = (&:;) = >> &,e:;, ec; a matric basis set | x| = 
(> &,8;)*, Then 9 is a complete valued ring.” According as R = R,, C, or 
Q, we write U = L(R, n), L(C, n) or L(Q, n). Using the methods introduced 


by v. Neumann” we can define exp x = 1 + 2 + i + .-- for any z and log 
2 

t= (¢—1)— Ba + for|z—1|< 1. These functions are continuous 

and 


exp(log z) = x if |x—1|<1 
log(exp xz) = x if |2z| < log2. 


Thus the mapping x — exp z is a homeomorphism between the neighborhood 
|z| < log 2 of 0 and a neighborhood M of 1. Since exp x exp(—z) = 1 the 
latterisin U. Itis readily seen that Uisa Lie group. Since 


(exp x) (exp y) = exp (x + y) 


if zy = yx we have (exp x)* = exp 2z and hence the real codrdinates of &; in 
exp x, = (€;,;) are canonical. Hence if N is the image of | x | < 6, , 6: S 3 log 2 
under the mapping x + a = exp z then a has the unique square root a’ = exp $2. 





 Deuring [1], p. 93. 
**von Neumann [1], p.6. The proof given there holds also for Rn and Q, . 
‘*von Neumann [1]. 
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As in §2 we have a* defined topologically-algebraically and a* = exp(az), 
Since 


|expz —expy|=O|z—-y| | log — logy| = O|az — y| 


in our neighborhoods we can prove for b = exp y 


(a%b*)"* = exp E log(exp ax exp wy) — exp (x + y) 


and 


(a*b“a*b*)""** — explz, y] —[z, y] = zy — ya 


if|2|,|y|< 6.” Thus the correspondence z — exp x = a sets up an iso- 


morphism between the Lie algebra of U and the Lie algebra ® in which x + y 
is as in the associative algebra and [z, y] = xy — yz. 

If G is a closed subgroup of U the result of Cartan’s quoted above shows that 
® has a neighborhood of 1 consisting of the matrices exp x where | x | < p and 
x e Xa Lie subalgebra of f. 


4. If U = L(R, n) or L(C, n) it is well known” that the derived groups L’(R, n) 
= 1L,(R, n) the set of matrices of determinant 1 and L’(C, n) = L,(C, n). Since 
det(exp x) = exp(tr x), rz = >> &j if x = (&;) the Lie algebras of L’(R, n) 
and L’(C, n) are respectively the derived algebras R;, and C,. Q, may be 
represented by 2n X 2n complex matrices. We define tr a, det a as the trace 
and determinant in this representation. As is well-known tra, detaeR. Itis 
readily seen that Q’. consists of the elements of trace 0. We show below (§8) 
that det a > 0 for a in L(Q, n) and L’(Q, n) consists of the elements of deter- 
minant 1. Hence we have, as before, that the Lie algebra of L’(Q, n) is Qi. 
R’, , C.(n > 1), Qi, are simple and hence L’(R, n), L’(C, n), L’(Q, n) are t-simple 
Lie groups.” We call these the real, complex and quaternionic unimodular 
groups. 

If S is an automorphism in ” it induces an automorphism in WW’ and in the 
Lie algebra ®t’. If a = exp x by the continuity of S we have a° = exp 2° and 
hence Sin ®’ induces Sin UW’. If Sis an anti-automorphism in R, a > (a*) and 
z — —2* are automorphisms in UW’ and ’ respectively. Since (a*)’ = exp( 2’), 
« — —zx* generates the automorphism in UW’. It has been shown that every 
automorphism in SR’ is of one of these two types. Applying this we obtain the 
following groups 9% of automorphisms for the unimodular groups. 

1. L'(R,n). The automorphism of R, are x > s ‘zs, the anti-automorphisms 





20 Cf. Birkhoff [1], p. 78. 

1 A proof of this theorem is given in §8. 

22 The results on Lie algebras required here and in the rest of this section may be found in 
Jacobson [2], pp. 545-548. 

23 We mean here an automorphism in the algebra &, i.e. (zz)° = xSafor real a. Thisis 
equivalent to the condition that S be a continuous automorphism in the ring R. 
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are 2 8 2's where 2’ is the transposed of x. These give the automorphisms 
a-s asanda—s- (a’)‘s in L'(R, n). The former set form a subgroup = 

L(R, n)/R*, R* the set of matrices al ~ 0 and this subgroup has index | or 2 
pee asn = 2orn>2. Ifniseven R* S L*(R, n) the set of matrices of 
positive determinant and L*(R, n)/R* has index 2 in L(R, n)/R*. Since any 
matrix in L*(R, n) has the form aa , a e R* and a; ¢ L'(R, n) we may choose 
two elements a; , —@ in each class mod R*. Hence L*(R, n)/R* = L’'(R, n)/D 
~ §, D = (1, —1), 3 the group « of inner automorphisms. If n is odd every class 
mod R* contains matrices in L*(R, n) and hence L(R, n)/R* = rin n)/Ri, 
Rt = R* 1 L*(R, n). In this case L*(R, n)/Ri = L,(R, n) = 

2. L'(C, n). Similar considerations show that the meer here are 
as as,a—s (a’)'s,a—s8 ds,a—s ‘(a’)'s, @ = (&;)ifa = (a;;). The 
first set has index 2 or 4 in Y% according as n = 2 or n > 2 and is isomorphic to 
L(C, n)/C*, C* the set ala #O0inC. L(C,n)/C*=L'(C, n)/D & &, D the 
set (1, where ¢” = 

3. L'(Q, n). The automorphism are a > s ‘as,a—s ‘(a’)'s. Ifn = 1 the 
last set is superfluous and if n > 1 the first set has index 2 and is isomorphic to 


L(Q, n)/R* = L'(Q, n)/D = 3, D = (1, —1). 


5. Suppose S is an involutorial anti-automorphism in the associative algebra 
ft, and Us be the set of elements invariant under this automorphism a — (a*)™ of 
W’. Because of the continuity Us is a closed subgroup of UW’. If a = exp z, 
(a°)’ = exp(—2‘*) and | x | is sufficiently small, a = (a*)”* implies 2° = —z. 
Conversely if ° = —2, z in R’ then exp z isin Us. Thus the Lie algebra of 
Usis Ss the derived algebra of Ss the set of S-skew elements. Except for certain 
small values of n (ef. below) the algebras sj are all simple and hence the Us are 
simple. If S, and Se are cogredient anti-automorphisms i in the same § i.e. 
8, = G's,G, @ an automorphism, it is evident that Us, and Us, are equivalent. 
Otherwise for n sufficiently large Us, and Us, are not locally isomorphic. If we 
identify Us, and Us, we find that the automorphisms of this group have the form 
a—a’ where G is an automorphism of commutative with S. Applying these 
principles we obtain the following ¢-simple groups and their automorphisms. 

1. The real orthogonal groups O(R, n, v) consisting of the real matrices a such 
that a's,a = s,, det a = 1 where s, = ey + --- + pp — Cpsipxt — °° 
fn, ¥ = 2p—-n =n, n—2,---,0o0r1 as nis even or odd and n > 8 if even, 
> 5ifodd. The group of automorphisms % consists of the mappings a — bab 
where b satisfies b’s,b = ps,, p ~ 0. Hence Y% is the factor group of these 
matrices with respect to R*. If nis odd p > O since the signatures of b’s,b and s, 
are the same. We may choose in each class mod R* a unique b; such that 
bis, = s, and det bh, = 1. Thus &% = 9% O(R,n,»). If nis even and» ~ 0, 
p> 0 and we may again choose a 6; such that Bed, = s,. However the ele- 
ments of determinant 1 and —1 do not belong to the same class. The former 
form a subgroup of index 2 = O(R, n, v)/D =, D = (1, —1). If n is even and 

» = 0 the representatives may be selected so that either biSob1 = 8 or bi8obi = 
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—s. The former elements having det = 1 form an invariant subgroup of 
index 4 = O(R, n, v)/D, D = (1, —1). 

2. The real symplectic group S(R, 2n): a in R, such that a’ga = q, q’ = ~q 
and n > 2.% Y% consists of a — b-‘ab, b’gb = pq, pe R*. These elements for 
which p > 0 form a subgroup of index 2 = § = S(R, n)/D, D = (1, —1). 

3. The complex orthogonal group O(C, n): a’a = 1, ain L’(C, n) andn>g 
if even, > 5 if odd. % consists of a > bab and a — b ‘ab where b’b = p, 
peC*. The former form a subgroup of index 2 = 3 = O(C, n)/D, D = 1, or = 
(1, —1) according as n is odd or even. 

4. The complex symplectic group S(C, 2n): a’ga = qq’ = —q, ain C, and 
n> 2." 9% consists of a b ‘ab and a — b ‘ab, b’gb = pq, peC*. Y= 3x 
S(C, 2n)/D, D = (1, —1). 

5. The complex unitary, unimodular groups U’(C, n, v): d’s,a = s,, s, andy 
asin 1,aeL/(C,n),n > 2. % consists of a > b ‘ab and a — bab, b’s,b = 
ps,, p¢€R*. The former forms a subgroup of index 2 and unless n is even and 
v = 0, this subgroup = 3 = U’(C, n, v)/D, D the set £1, ¢" = 1. If nis even 
and v = 0, $ has index 4 in Y. 

6. The quaternionic unitary groups U(Q, n, v); ain Q, such that d’s,a = s,, 
s,andvasinln>41.™% Y% consists of a— b ‘ab, b’s,b = ps,,p¢R*. As before 
Y= ¥ = U(Q, n, v)/D, D = (1, —1) unless n is even, vy = 0 when $ has index 2 
in %. 

7. The quaternionic symplectic group S(Q, n): a in Q, such that a’qga = q¢ 
where @’ = —q,n > 4." Y& consists of a— b ‘ab, b’gb = pg, pe R*. The ele- 
ments with p > 0 form a subgroup of index 2 = $ = S(Q, n)/D, D = (1, -1). 

The groups enumerated here are not locally isomorphic to any of the uni- 
modular groups since their Lie algebras are not isomorphic. Altogether these 
ten classes include all ¢-simple Lie groups (relative to local isomorphism) except 
those of dimension 14, 28, 52, 78, 133, 248, 56, 104, 156, 266 and 498. The 
groups O(R, n, n), U'(C, n, n) and U(Q, n, n) in our list are compact. This 
may be seen by noting that the conditions imposed entail the boundedness of 
the codrdinates of these matrices. 

8. Suppose F is any quasi-field, F, the ring of n X n matrices over F and 
L(F, n) the group of units in F,. The method given by Dickson” for the case 
F a finite field may be applied to prove that any element of L(F, n) has the form 
bd,, where b is a product of matrices of the type 1 + 6¢,,, 7 # sandd, = 1+ 
(6 — lem, 6 ¥ 0. 

We have 


(1 + Ber.)(1 + (5 — 1)err)(1 + O€r2) "(1 + (5 — 1)err)* = 1 + (1 — 8) 6ern 


It follows that if F has more than two elements, 6 is a product of commutators. 





*4 These elements all have determinant 1. Cf. v.d. Waerden 1, p. 10 for the cases S(R, 2n) 
and S(C,2n). For the other cases see §8. 
*5 Dickson [1], p. 78. 
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d, is a commutator if 6 is a commutator in F. If F is commutative and det 
a = 1, det d, = 6 = 1 and hence ais a product of commutators. 

This result applied to F = R, = C shows that L’(R, n) = L,(R, n), L'(C, n) = 
L,(C, n) and L*(R, n) consists of the elements bd, with 6 > 0. If F = Q we 
employ the representation by complex matrices and obtain that L’(Q, n) s 
L,(Q,n). Since det b = 1, det d, = N(6) = 8 >0. N(6) = 1 forainL,(Q, n). 
If 6 = 1 it is evidently a commutator and if 6 = —1, iji ‘jy ' = 8 (1, i, j, k the 
quaternion units). If 6 # 1, —1, R(é) is isomorphic to C and hence we have 
tin R(6) so that ¢ = 6, = 1. These exists a uw such that ua = au if a e R(S). 
Hence 6 = ué mw and in all cases 6 is a commutator. It follows that L’(Q, n) 
= L,(Q, n). 

Any element @ in R, C or Q can be joined to 0 by an arc in these fields. It 
follows that the matrices b can be joined by ares in L’(R, n), L’(C, n), L’(Q, n) 
tol. Iféis any element ~ 0in C or Q or 6 > Oin R then it can be joined to 1 
by an are which avoids the point 0. Finally if N(6) = 1 in Q, may be joined by 
an arc consisting of points of norm 1 to 1. It follows that L*(R, n), L(C, n), 
L(Q, n), L'(R, n), L’(C, n) and L’(Q, n) are connected groups. 
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ON A THEOREM OF MARSHALL HALL 
By WILHELM Maanus 


(Received October 1, 1938) 


It is the purpose of this note, to show that a simple proof of the theorem 4.1, 
as stated by Marshall Hall in his paper on “Group-rings and Extensions’ |” 
can be given with the aid of a lemma, proved below, which also might be of 
some interest in itself. 

Let us denote by z, y, --- the generators of a group H, and by , 9, --- , a set 
of free generators of a free group F. By the correspondence i > z, 7 y, --- , 
the free group F is mapped homomorphically onto H; therefore we have F/R 
~ H, where R is a self-conjugate subgroup of F. Let R’ be the commutator— 
subgroup of R, and let us denote R/R’ by A. Then the group G = F/R’ is an 
extension of the abelian group A by the group H. Let Z, gj, - - - be the generators 
of G, corresponding to the generators %, 7, --- of F. Hence we have F >G—H, 
Z—Z-— 2. The group A depends on the number n of generators of H; this 
number will not be the same throughout this paper, and we shall write A(z, y, 
--+) instead of A, whenever this will be necessary to avoid confusion. The 
group A is the direct product of cyclic groups of infinite order. The number 
of the direct factors, or the rank of A, is equal to” 1 + (n — 1)j, if j is the order 
of H. 

Now there holds the following 

LemMA. Let t,, t,, --- be independent parameters, corresponding to x, y, --- 
respectively, and permutable with all the elements of H. Then a true (that is, a one- 
to- one-isomorphic) representation of G by matrices is given by putting 


‘ z 0 ‘ y 0 
(1) z(F ), j-(? 9 ee 


We shall prove this lemma only in the case that H is a group of finite order j. 
If H is abelian, the lemma has been proved before.’ 

It is clear that, by the correspondence (1), a representation of G is given. 
For if ¢(z, y, --- ) = 1 is a relation holding for the generators z, y, --- of H, 
we have 


() 6@,9,--0 (7 4), 





1 Annals of Math. (2), 39, pp. 220-234, 1938. 

*s. O. Schreier, Abhandl. Math. Sem. der Hamburgischen Universitat. 5, p. 179, 1927. 

+s. W. Magnus, Uber den Beweis des Hauptidealsatzes, Journal fir die reine und ange- 
wandte Mathematik 170, pp. 235-240, 1934. 
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where L is a linear function of the parameters ¢, , t, , --- 
(3) L=tAzt+tht+--:, 


the coefficients hz , hy, --- being elements of the group-ring H* of H. There- 
fore the group generated by the matrices (1) has a quotient-group isomorphic 
to H, the self-conjugate subgroup A corresponding to the identity of H being 
the additive group of certain linear forms of type (3). It is clear that the 
representation of G by (1) is a true one if and only if the number of the linearly 


independent forms (3) equals the rank of A(z, y, --- ), for A necessarily is a 
factor-group of A. This may be seen as follows. By definition, the group G 
may be considered as the “most general” group with generators Z, 7, --- , having 


the following property: Whenever ¢,(z, y,--- ) = 1, oz, y,---) = lin A, 
then the elements ¢;(%, 7, --- ) and ¢,(Z, 7, --- ) of G@ are permutable; here the 
expression “most general” means that every group with generators Z, g, --- , 
having this property, is a quotient group of G. Now the group generated by the 
matrices occurring in formula (1) actually has this property. 

For later purposes, we may notice here the relations (using now the equality- 
sign instead of the arrow): 


4) ia 5 ( xt i): gp: ( x yx 4 
* eta? ay? eam —t,ayx+ta+t, 1/° 


’) oF = £°9(4, 9, ---)2 = # 4) 


First, we shall prove the lemma in the special case that all the elements 
u,v,--- Of H, except the identity, are generators of H. In this case we have 


u 0 i v 0 
(5) a=(/ g' p= (? 9 ee 


and, of course, t, = Oifu= 1. By a theorem of Reidemeister,* A(u, v, --- ) is 
generated by the elements 


ag 1 °) 
(6) uw w= ea. 1)? 


and therefore we must prove that there are 

(7) 1+ (j — 25 = (§ — 1) 
linearly independent forms among the functions 

(8) —tu + tw + br, 


(j — 1)’ being the rank of A(u, », ---). Apparently, the (j — 1)” products 
to(u #1, v ¥ 1) are linearly independent, and therefore the same is true for the 
(j — 1)’ functions (8) containing those products. 





‘Abhandl. Math. Sem. Hamburgischen Universitat 5, p. 7, 1927. 
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Now we turn to the investigation of the general case. Suppose the lemma , 
had been proved already if H has exactly k + 1 generators 2, --- , 7%, y, and Fr 
let us assume that the generator y can be eliminated by a relation 











(9) p(x, Pr ae ty = 1. ' 
W 
| Then we have to show: If the group A as defined after (3) has the rank 1 + kj of 
4 for the group generated by the matrices 
‘ wi x; 0 fy 0 (1 
ay} pit (10) X= t 1)” he t 1)’ (¢ =1,---,k), 
\) % zi y Ne 
\, fae then the corresponding group A* defined by the matrices P 
| __(% 9 | 
Ry (11) z= ’ (i =1,--.,h), 
i ts; 1 kr , 
iil? f 
ve ) has at least the rank 1 + (k — 1)j. Now we have . 
i ths = @ (I 
ah METS (12) p(%1, +--+, de) = y 
iia : . L 1 . 
Baha a 
where L is a linear function of ¢,,,---,¢, of the type occurring in (3). 
Therefore (1 
y re . TI 
(13) p(a, ete Ex) nes - + ty 9! de 
This shows that we may pass from A to A* by postulating the relation 
(14) ty vis —Lp'(x, oS in Zr) (2 
for the parameters tz, , --- , tz, , ty, the coefficients of this relation being elements al 
of the group-ring H*. Now there are at most j linearly independent linear th 
forms to be combined from 
(15) ty + Lp “(x1 ed ae Lk) 
by multiplication with elements of H*, for H* contains exactly j linearly inde- a 
pendent elements. Therefore, by adding the equation (14), the rank of A will (2 
be diminished at most by j7. This completes the proof of the lemma.—From 
(4’), we easily can deduce the following TI 
CoroLuary: Given any quotient-group G/C of G, where C is contained in A, 
we can construct a true representation of G/C by matrices of type (1), by postulating (2 
the existence of certain linear relations for the parameters tz , ty , --- , the coefficients 0 
of these relations being elements of the group-ring H*. or 
In proving the theorem 4.1, page 225, of the paper by Marshall Hall, we shall P 





adapt the notations used there. 








ma 


ts 
ar 


ON A THEOREM OF MARSHALL HALL 767 


Let o:(t, y, -:: ) = 1 be a set of relations holding for the generators of H. 
From (1) and (2), we have 


1 0O 
(16) ¢i(, Y,-° ) = (; '), 


where L, = tet; + tyys + ::- ; and xi, Yi, --+ are elements of the group-ring H* 
of H. If h:is an arbitrary element of H* we have by (4’): 


al 1 0 

" ak ia ?: 
Now we define elements &, 7, --- by 

1 0 1 0 
(18) e=(; Di =(; 9 eee 
i, t,, --» being parameters of the same kind ast, ,t,,---. If wis an element 
of H, we have : 

eit sealed 1 0 
" satel &. ) 


and generally, if h is an element of H*, we have 
1 O 
as! es teh 4 


The group formed by the elements £* apparently is “operator free” in the sense 
defined by Hall (cf. p. 223). 
Now we have 


(20) “pe x 7 
ee ye 1)? 


and therefore we will find ¢,(£#, ng, --- ) by substituting 4a + ¢, for ¢, etc. in 
the linear form L,; occurring in the matrix 


=(; i) 
di = Ly 1)’ 


(21) as ee 1 7 
G2, af, ---) = bd + tai + (hy ttyyi t+... 1)" 

Therefore we have 

(22) (EE, ng, ---) = o:(4, 9, -- EM” ---. 


On the other hand, it follows from (16) that [] ¢:‘(2, g, ---) = 1 then and 
only then, if 


(23) DD atihs = Dyihs = --- = 0 


that is: 


5 MRT! 2 BPE = “=2 
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This is the same as 
(24) > arth: = Liyyih: = --- = 0. 


From (22) it follows that rz} = 2;, yy? = yi, --- in the sense 2; > Ys, +> are 
defined by Hall. Therefore >> xh; = >> yshi = --- = O are the necessary 
and sufficient conditions for [] ¢:‘(%, g, --- ) = 1. This is the statement of 
the theorem 4.1. 


FRANKFURT A. M., GERMANY. 
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_ ADDITIVE SET FUNCTIONS ON GROUPS* 
38 i 
es By 8S. BocHNner 


(Received April 1, 1939) 


INTRODUCTION 


All distinctive results of the Lebesgue theory of integration are based on the 
fact that Lebesgue measure is completely (that is countably) additive. Finitely 
additive Jordan volume is not sufficient. The completeness of L,-spaces, the 
Riesz-Fischer theorem, the one-to-one relation between integrable point func- 
tions and absolutely continuous set functions—all these features of the Lebesgue 
theory are absent in the theory of Riemann integration. 

The limitations of Jordan volume and Riemann integral are prevalent in the 
theory of almost periodic functions, whether these functions are given on the 
open line’ or more generally on any group G.” The mean-value of almost 
periodic functions has the properties of a Riemann integral, and following 
Daniell’ and v. Neumann‘ we shall show in Section I that it can be reduced to a 
Jordan volume. However, if the group @ is not compact, it will in general not 
be possible to imbed the Jordan volume in a Lebesgue measure, and for that 
reason theorems of the type of the Riesz-Fischer theorem have no obvious 
analogues. For functions on the open line, Besicovitch has constructed an 
elaborate solution of the Riesz-Fischer problem; however this solution cannot be i 
readily extended to groups in general. ; 

In the present paper we will show that some distinctive properties of Lebesgue 
integrals can be so formulated that their essential parts will remain valid for 
Jordan volume as well, and we will supplement the general statements by special 
results involving mean-values of almost periodic functions and their Jordan 
volume on groups. 

The principal result of section II is a generalization to Jordan volume of the 
theory of Nikodym.’ Nikodym has shown for arbitrary Lebesgue measure that 
an additive set function F(E) which is absolutely continuous is the integral of an 
integrable point function f(z), 








EG FS RE RE BT Se CET 





*A summary of the results has appeared in Proc. Nat. Acad. 25 (1939), pp. 158-160. 
‘For a full account see Besicovitch [2]. 
*See von Neumann [11]. 
* In Daniell [6] a Lebesgue measure is constructed implicitly as the basis for any given | 
| Lebesgue integral. 
r ‘See von Neumann [10]. Here again the resulting measure has Letesgue properties. 
However the construction is aimed at explicitly, and is closer to our own construction 
than the one in Daniell’s paper. ; 
*Compare Saks [15], Chapter I, 13. 
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770 §. BOCHNER 
F(E) = [ f(z) do. 


Any Lebesgue integrable function f(x) is the limit, in the norm / | f(x) | dv, 


of Riemann integrable functions g(x) (for instance, of finitely-valued functions). 
Nikodym’s abovementioned result is equivalent with the statement that in the 
Banach space of additive set functions F(Z) with the norm 


P(E) = sup D | FE) | 


(the sets H,,---,H, being disjoint) the integrals of finitely-valued point 
functions are everywhere dense. In this form, we shall see, the result also holds 
for any Jordan volume. Thus in case of Jordan volume, our Banach space of 
set functions takes the place of the Z,; space, and is isometric to it if the given 
volume is completely additive. Similar generalizations are available for L, 
spaces, p > 1. Our proofs are new, even in the case of Lebesgue measure. 

On the interval 0 S x < 1 the absolute continuity of a point function F(z) of 
bounded variation can be defined in two different ways. The classical definition 
requires that for any finite number of non-overlapping intervals (a, , b,) the sum 


> FO.) — F(a) 


shall tend to 0 as |b, — a,| tends to 0. Another definition which we suggest 
in the present paper is based on the fact that the functions of bounded variation 
represent the functionals on the Banach space C of all (periodic) continuous 
functions g(x) with the norm max|¢g(x)|. We call a function absolutely 
continuous if, uniformly in all continuous functions g(x) for which | g(x) | < 1, 


1 1 
the integral | [ y(x)dF (x) | tends to 0 as [ | g(x) | dx tends to 0. In other 
0 0 


words, a functional on C is called absolutely continuous if in the unit-sphere of 
C it is continuous relative to the norm of the space L;. It is easy to see that in 
the given case the two definitions lead to the same absolutely continuous func- 
tions. In section III we will prove an analogue for general groups (the interval 
0 < x < 1 being the group of real numbers mod 1). 

A. Plessner® has proved that F(x) is absolutely continuous if the (periodic) 
function 





e(h) = [ |d.(F (2 + h) — F(2))| 


is continuous at the origin and hence continuous throughout. We shall make the 
weaker assumption that e(h) is almost periodic (no continuity being postulated) 
and we will prove the assertion under this assumption on general groups. 





6 See Plessner [13]. For generalizations of the theorem in other directions see, ¢.&- 
A. P. Morse [9]. 
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Finally, section IV deals with a theorem of M. Riesz, and section V with 
functions on the open line — © <2 < , 


I. Pornt Functions 


1. Jordan fields 


We consider an arbitrary point set G whose points will be denoted by z, y, ---. 
A Jordan field is a class § of subsets of G whose general element will be denoted 
by £ with the properties: 

1) § contains the empty set; if E « §, then@G — Ee §; 

2) if E, , Ee € v then E,- Es «€ 3, E, + Ere RN 
and a numerical function on § (Jordan volume) which will be denoted by vEZ or 
by| Z| with the properties: 

3)0 S$ vE £1,|0| = 0,|G| = 1;’ 

4) if E,-E, = 0, then | Ei + EB: | = | Hi | + | Ey |. 


2. Partitions 


If E « § then a partition of H—it will be denoted by 6 = 6(Z) = (E,)—is a 
representation of F as a finite sum of mutually exclusive elements E, (v = 1, --- , 
n); the set #, will be called an element of 6. If the set which is to be partitioned 
is not specified, the partition refers to the whole set G. The partition 6 will be 
called regular if | E,| > 0 for each »v. If 6 = (E,) and 6’ = (E,) are two 
partitions of the same element # then we put 6 < 6’(6’ is greater than 4) if each 
E, is contained in some E, and 6 ¥ 6’. A sequence of partitions {6,} is called 
monotone (and regular) if 6, S dn41, n = 1, 2, --- (and each 6, is regular). A 
sequence {6,} is called a closed sequence, if it is monotone and if 6, consists of n 
element E,,,v = 1,---,m. Obviously in a closed sequence the first partition 
4 consists of one element and each 6,4: arises from 6, by subdividing just one 
element of 6, into two and leaving the others unaltered. It is easy to see that 
any monotone sequence can be made into a subsequence of some closed sequence. 


3. Riemann integrals 
Let f(z) be a bounded real function on G. Corresponding to any partition 
6 = (Z,) we form the sums 


M(s) = > sup f(z)-|E,|,  M() = dX inf f(z)-|Z,| 


vy zeky, vy cek, 


and their limits 


M = lim M(8), M = lim M(). 
8 


The limits are formed in accordance with the convention that lim, A(é) exists 
and is equal 1 if corresponding to any ¢ > 0 there exists a d) = 5(e) such that 


"We impose the convenient restriction that the volume of the total set is finite. In our 
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for 5 > 5),|A(d) — 1| < « We shall also admit 1 = + ©, in which case 
A(6) > 1/efor 6 > 6(e). 

Obviously M = M. The function f(x) belongs to the class R (Riemann 
integrable functions) if M = M. The common value of the upper and lower 
integral will be denoted by 


(1) M.f(z) or [ fiz) dv or / f(a) dv. 


The extension to bounded complex-value functions is immediate. A real or 
complex function f(x) belongs to FR if and only if, for 6 = (£,) 
lim 22 | f(x) — f(y) |-|B,| = 0. 
6 v zyeER, 
The class R contains the constants, and is closed with respect to addition, 
multiplication and formation of uniform limits. As a consequence, if f(z), 
g(x) e R and if ¢g(u, v) is uniformly continuous on the set of values of f(x), g(x) 


then g(f(z), g(z)) €R. 
Also by definition of (1), the characteristic functions 


1, zeE 


wp(x) -| 
0, reG—E 


belong to R and f wz(z)dv =| E|. 

The subclass Ro of R shall consist of functions f(x) for which corresponding to 
any « > 0 there exists a partition (#,) such that the oscillation of f(x) is < « 
on each E,. The subclass Ro has all closure properties of R and contains all 
characteristic functions wz(x) and their linear combinations. 

We will also consider the norms 


\f(2) ll = ( [ise 'd) ry 


The space FR if provided with this norm will be denoted by R,. It has all 
properties of a Banach space except completeness. The completions will be 
discussed in later sections. 
Let f(x) « R, and | f(x)| S K, and let 6 = (E,) be any partition of ©. 
Putting 
b, = sup f(z), c, = inf f(z), 


zeky, zeEy, 


1 
(2) = [ fla) do, 


we have | b,| < K, |c,| S$ K,|a,| S$ K. Each of the functions 


g(x) = x bwr, (x), v(x) = DO cwe,(z), 





(3) g(x) = x awe, (x) 
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‘<a step-function having a constant value on each element of 6. Obviously 
g(x) = f(z) => V(x), = ox) = g(x) 2 H(z) 
and hence | f(z) — g(x) | S o(z) — ¥(x). Now 


fine) - 9(2) Pdv s [le—virdv sexy [  - Wao 
= (2K)’"(M(6) — M(8)) 


and therefore we conclude: The step functions are dense in R, ; especially each 
f(z) from R, can be approximated by functions of the type (3), with coefficients 
defined by (2). 

Given 6, we shall denote (3) by fs(x) and call it the projection of f(x) on 6. 


4. Modules of functions 


We shall now invert the construction of Riemann integrable functions. 

TuEoREM 1. Let C be a class of functions {f(x)} on @ each defined everywhere 
and bounded, and let M,f(x) be a number which is defined for all f(x) and let the 
following properties hold: 

1) C contains the function f(x) = 1; 

2) if fieC, foeC and c and cz are constants, then fife eC, cifi + cafe VU; 


Merfi + Cofe) nas aM.fi + CoM wfe ; 
M.f(z) 2 0 f(x) 2 0; 

8) If f. converges uniformly to f, and f, eC, then f eC, and Mf, converges 
to M,f. 

Then, there exists a Jordan field § such that C belongs to the class Ro on that 
field and that 


(4 M.f(z) = [ f(a) do. 


) 

) 
5) Mzl = 1; 
) 

) 


Also the class C is dense in each R, , p = 1. 


Before giving the construction of § we observe that property 8) will not be 
needed immediately but will be used only much later. Also if property 8) is 
hot present to start with it can be easily secured by a closure process. On the 
other hand if property 8) is given, 4) is a consequence of the other properties 


) since | f | is a uniform limit of polynomials in f. 


The construction of § is as follows. Corresponding to any set A (its char- 


| acteristic function will be denoted by w,(z)) we define the outer and inner 


Jordan volumes 
v*A = inf M,f(z), 
vxA = sup M.g(z), 


f(z) P wa(x), 
g(x) Ss w(x). 


f(x) €C, 
g(x) €C, 
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In these definitions we may restrict ourselves to functions f(x) for which f(r) < | 
(and to functions g(x) for which g(x) 2 0). In fact, if f(x) eC and f(z) > 
w(x), then by our assumptions the function 


1 + f(x) — |1 — f(z) | 
2 





g(x) = min (f(x), 1) = 


belongs again to C, is again = w,(x), and M.y(x) S M,f(z). 

Obviously v*A = v*#A. The class § shall consist of those sets A = E for 
which v*A = vxA, and vE = v*E = vxE. Since v*0 = v0 = 0 we have | 0| = 0, 
similarly we find |@| = 1. It follows from assumptions 1), 2), 5) and 6) that 
v*(G — A) = 1 — v«A and v«(G — A) = 1 — 0*A, hence if He F then G — FE  §. 
It also follows that v*(A1 + Az) S v*A; + v*A2 and if Ai- As = 0, ve(Ai + Az) > 
v#Ay + ve Ag . Thus if Ey ’ Ey € AF then EB, + E, € AF and, if E,- E, = 0, then 
| EZ, + E,| = | Hi | + | Fel. 

We next want to show that HE, , HE, « § implies E,-F,¢«§. Now v*E = »4.F 
if and only if corresponding to any 7 > 0 there exist functions f(z), g(x) €C, 
with 1 2 f(z) = ws(x) = g(x) = O such that M.(f(x) — g(x)) S n. Let f,(z), 
gi(x) and fo(x), g2(x) be such pairs of functions for the sets EF, , EZ, respectively. 
Then 


1 2 fi(x)fo(x) = we,.z,(x) 2 gi(x)ge(x) = O. 
fife, 9192 € C and, since 
OS fife — gige S (fr — gfe + gilfe — ge) S (fi — gi) + (fo — 9), 


we obtain M.(fife — gig2) S 1 + » = 2m. This completes the proof of our 
first assertion that § is a Jordan field. 
We next consider a fixed real f(z) «C. We introduce the point sets 


A, = E{f(z) 2a}, -x <a<o, 
the functions 
fa(x) = min {f(z), a} 
which again belong to C, and the numbers 
g(a) = M.f,(z). 
g(a) is monotonely non-decreasing, also g(a) = 0 fora S a = inf f(x), and = 


M,f(x) fora = a, = sup f(z). Fora < b, 


(5) wa,(2) < b(@) = fala) Sw,,(z). 
b—a 


Therefore 


vxAq = M, fete) — fa(%) 


ms h > 0, 
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and hence 
v4A, 2 D*¢(a); 


similarly 
v*A, S D(a). 


If for a given value a, D*g(a) = D™¢(a), then vA, = v*Ag. Thus A, belongs 
to § for a dense set of numbers a. Given 7 > 0, 7 < } we can select from the 
dense set a finite sequence a = b,, vy = 0, 1, --- , nm for which 


&-l=b<bh <---> <b, =a+1 and 
bai — b» < 9. Putting Z, = As,_, — Av, , we have 


sup | f(z) — fly) | S ( — bu) <a, 


zyeERy, 


and this proves our assertion that f(z) «Ro. Also 
(6) Y ba |B S [ fle) do s Lb | Bh. 
v= G y=] 
On the other hand, 
elon) = ofby), = 5 — 9 @, _ 4. 








y=] b, — b,-1 
and by (5), 
| Ay, | s 202 — eb) < 14, | 
{ b, —_ by-1 4 
and hence 


D by-s|Bs| + bas, S o(bs) + (boAry — (be); SD br | By| + be An, 
Since Ay, = 1, y(bo) = bo, Av, = 0, we finally have 
> be1|Br| S olbs) = Mafla) < Db, | Bl 
Comparing this with (6) we obtain (4). 
The last statement of theorem 1 will be satisfied if we show that for any E « §, 


| | f(x) — we(x) |? dv can be made arbitrarily small for an appropriate choice of 


f(z) eC. Assuming 1 = f(x) = w,(zx) the integral is < / (f(z) — wr(x)) dv = 


M.f(x) — | E| and this can indeed be made as small as we please. Any class 
of functions having the properties of theorem 1 will be called a module, and 
the Jordan field § whose construction we have just described will be called a 
generated Jordan field and it will be denoted explicitly by &c . 


ce 
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5. Lebesgue fields 


A Lebesgue field is a Jordan field having the following properties: (i) If E, ¢ §, 
y = 1, 2,---, then EF, + E, + --- e&, (ii) If E, converges monotonely to E, 
then | E, | converges to | E | and (iii) Any subset of a set of volume 0 is again an 
element of the field. 

TuroreM 2. A generated Jordan field §c can be extended to a Lebesgue field 
if and only if for any sequence {f,(x)} C C, the assumptions 


(7) lim f(z) =0, (|fa(x)| SK 
emply 
(8) lim M,.f,(x) = 0. 


In fact if § is part of a Lebesgue field then M,f(x) is a Lebesgue integral and 
(8) is a consequence of (7). Conversely it is known that § can be extended to a 
Lebesgue field if the assumptions 


imply 
(10) lim | Z,| = 0. 


Now corresponding to each EF, we pick a function f,(2) «eC such that 0 < 
fr(z) S wez,(x) and | Z,| — M,f,(x) — 0. Obviously (9) implies (7), this 
implies (8), and this again implies (10). 

An an illustration consider the module C consisting of all continuous almost 
periodic functions on the line —» < x < o with the mean value 


M.f(a) = lim ef §(0 at 


say. This module will be called the module of Bohr functions. The sequence 


2 
f,(z) = (sin g) converges to 0 at every point, and yet 


n 
a\? 1 — cos “ 1 
M,(sin-]}] = M,———_—_ = =. 
(sin *) i 
Thus the Jordan field generated by C cannot be extended to a Lebesgue field. 
Our module C can also be used to illustrate another impossibility. Let g(x), 
h(x) eC, and let w(x) be any function on —~ < x < & which is Lebesque 
integrable in the ordinary sense, such that g(x) S w(x) < h(x). Obviously 


1 - He 1 T 
i M,o(z) < lim | sin 2: < : 
(11) g(x) <= im aT po) dt = lim oT od) dt < M,h(z) 





8 See for instance Jessen [7]. 
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Now, if f(x) eC, then for any a, the point set A, = E{f(t) = a} is measurable 
in the ordinary Lebesgue measure. Hence, if for a special value of a, Aq « §, 
then, by (11), 


SiS oe we 
|Aa| = lim +7 [o..@) dz. 


But it was shown by H. Bohr that the latter limit need not exist for every a 
if f(z) is an arbitrary almost periodic function.” Therefore it was essential in 
our proof of theorem 1 to argue the existence of a dense set of numbers {a}, for 
which E, ¢ §; the property does not necessarily hold for every a. 


II. Spaces or Set Functions 
6. Definitions 


Apart from point functions we shall also consider set function. Every set 
function F = F(#) will be defined on all sets E of some Jordan field § and it 
will be finitely additive and of bounded variation. Point functions will be 
denoted by small letters, set functions by capital letters. If a set function is 
the indefinite integral of a point function, it will be denoted by the same letter. 
Thus if f(z) e R, then its indefinite integral will be denoted by F(E£), and so 
we have 


F(E) = [ f(a) do. 


Similarly the indefinite integral of f; , ¢, gn will be denoted by F; ,,G,. Also 
we shall say that the set function F = F(E£) belongs to R, Ry , R, , C ete. if it is 
the indefinite integral of a point function belonging to R, Ry, R,, C ete. But 
we wish to stress that unless otherwise stated a symbol like F(#) will denote 
a set function which need not be an indefinite integral. 

If | Ey | > 0, F(EB) is said to be constant on E, , if for E C Ey 


F(E) _ F(Ey) 
|E| =| Bo 


(which implies that F(Z) = Oif | Z| = 0). The common value of the fractions 
(12) will be called the “value” of F(Z) on Ey. If 6 = (E,) is a regular parti- 
tion of G, then F is called a step function on 6 if it is constant on each E,. 
Also if a, are its values on E, , then F(Z) is the indefinite integral of _ Aw x, (2). 





(12) 


In particular if F(Z) is any set function and 6 = (Z,) is any regular partition, 
F(E,) 
= |B, | bie Bal 
jection of F(E) on 6 and will be denoted by F;(EZ). Projections will be an 
important means in our study of set functions. 





then the step-function with the values a, = on 6 will be called the pro- 





*See Bohr [5]. 
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7. Norms 
Given p = 1, we consider for any F(#) and any 6 = (E,) the expression 


| F(E,) |? 1/p 
(13) A(6) = (x B et) ; 


If p > 1, in order to avoid unnecessary complications we shall form (13) only 
for such functions F(Z) for which F(Z) = 0 if | Hy | = 0, and we agree that 
for | EZ, | = 0 the term | F(Z,) |’/| Z, |?” in (13) shall have the value 0. A(s) 
is monotone in 6 that is A(6) < A(6’) if 6 < 6’; therefore it approaches a limit 
as 6 increases; also the expression 


|| F ll =F lip = tim A@) 


has the properties of a Banach norm. Moreover by our definition of the pro- 
jection F;, A(é) = || Fs ||, and thus 


| F ll = tim [| Ps IL 


The resulting complete” Banach space will be denoted by V,, p = 1. The 
space V, has a complicated structure; we will be interested also in its closed 
subspace AC (space of absolutely continuous functions). A function F(E) 
belongs to AC if there exists a function n(e), lim.o n(e) = 0, such that | Z| < 
n(e) implies | F(Z) | S e«. We further add the Banach space V., with the norm 


| F(é) | 
|B] 





IF Il = 11 F lle = sup (|E| #0). 


As a rule we will assume p ¥ o. 


If Fe V,, p > 1, or Fe AC, then there exists a monotone sequence {6,} 


such that 
(14) || F || = lim || Fs, ||. 
Now, we may assume that the sequence is closed and regular. In fact let (5, } 


be any monotone sequence for which (14) holds. Let Eo be any element of 
5,1 and let | Eo | > 0. There are elements of 6, whose sum is Ey, we shall 


denote them by E,, --- , Z,. We assume that | Z,| > 0,---,|#,| > 0 and 
| Zuit| = --» = |Hy| = 0. Replacing in 6, the sets Z,, --- , Ey by the sets 
Ei, Ee, +++ ,Bya, Ey + Eyst, + --- + Ey we will obviously not alter the 
value of || Fs; ||. Carrying out this contraction for successive values of 
n = 1, 2,--- and all sets Ey we obtain a new monotone sequence {8,}. It is 


regular and it satisfies (14). Finally the latter sequence can be made the 
subsequence of a sequence {6,} which is also closed. 


10 It is not hard to conclude by the usual reasoning that any Cauchy sequence is con- 
vergent towards a limit element. 
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Obviously, Vp, p > 1, contains Cl(R,), (closure of R,), and AC contains 
Cl(R,). We shall prove that these pairs of spaces are actually identical. A 
special proof will be used for the spaces V,, although the method which will 
apply for the space AC could also be used in the other cases. 


8. An inequality for p > 1 


For p > 1 and e > O there exists a finite number K(p, e) such that for all 
real numbers f, 


(15) lf-1P S Kp, MSP? +p-—1—pf)t+ els. 


In fact, the function A(f) = | f |? + p — 1 — pf” is continuousin —» <f < « 

and > 0 for f ¥ 1, and |f — 1|? = O((f/)) as |f| > ©. Thus|f-—1P s 

M(p, »)-(f) for |f — 1| > 2, 7 > 0. Now given any e > 0 there exists an 

» > 0, such that | f — 1| S n implies |f — 1)? S e|f|?, and this proves (15). 
THEOREM 3. For p > 1 and « > O the inequality 


(16) lf — fol? S K(p, IFIP — [fe lP) + lS II 


holds for any f(x) « R and any partition 6 = (E,). 
Let f(z) be any real function from R, and let E be any set for which 


| f(z)dv = |E|. Putting f = f(x) in (15) and integrating over EF we obtain 
Eg 


am) [1 fle) — 1 dos KO, 0 ff |e) I do — | B|) +6 f 1 s0) P ae 


Hence we obtain for any f(x) in R, 


[ ine) — a P dos K(p,0)(f 142) Pao — ja? | Bl) + ef 1ste) Pa, 
where 


ale i 
a= iE] Se) dv. 
Putting H = E,, and adding for all » we obtain (16). 
TuroreM 4. For p > 1 and e > O the inequality 
(18) ||F — Fel = K(p, (I FIP — || Pol) + ell FIP 


holds for every Fe V,. 
For every 6, F; is a step function in R, , and for 4’ > 6, (Fx)s = Fs. There- 
fore, by theorem 3, 


(19) || Fo — Fs |? S K(p, ¢)(\| Fo |I? — || PoP) + € I] Pe IP 
for &’ > 6. Taking the limit with respect to 6’, || Fs || tends to || F || and 
Fs — Fy || = || (F — Fy)s- || tends to || F — F; ||, and (18) follows from (19). 





" This function has been introduced in F. Riesz [14]. 
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9. The structure of V,, p > 1 


As an immediate consequence of theorem 4 we have 

TuroremM 5. If FeV,, p > 1, and {6n} ts a sequence of partitions, then 
|| Fs, || — || F || amplies || F — Fs, || — 0. 

Since each F; is a step function we hence obtain 

TuroreM 6. The step-functions are dense in V,,p > 1. Therefore, V, = 
Cl (R,), p > 1. 

On the basis of theorems 6 and 5 various properties of ordinary L, spaces 
can be established. We shall collect them into one theorem, and omit the 
proofs. 

THEOREM 7. The spaces V, and V,,p >1,¢ > 1, p = q/q — 1, are conju- 
gate. Corresponding to any functional X(F) on V, there exists an element G ¢ V,, 
such that 


X(F) = lim M.(fs(x)gs(x)) 


and vice versa (the projections F;(E), Gs(E) being the indefinite integrals of the 
point functions f(x), gs(x)), and the norm of X(F) is || G||q. 
A sequence {F,}, F, ¢ V, is weakly convergent if || F,|| S K and 


lim F,(£) y © 


exists for each E, the limit defining the limit element. Also, if {F,} converges 
weakly to 0, then for a suitable subsequence F, 


“vil — fO(m"?) if 1<ps2 
ZF a ~ \0(m') if pz 2.” 











Finally, if V, is separable it has a basis consisting of step functions.” 


10. A mapping process"* 


Let {6,} be any closed regular sequence of partitions of @. Denoting the 
elements of 6, by Z,,, v = 1,---,n, we may assume that for each n there 
exists a uw such that En.1, = En, forvy = 1,---, mu, Engi + Ensue = 
Ens and Ens, = Enya for v = pw + 3,---,n + 1. We now form the 
numbers 


(20) to=0, tr=D|E,|, v=1,---,n, n=1,2,---, 
p=1 


and with these numbers the linear intervals 


ta ,»—1 < t < ta 





12 The proof in Banach [1], p. 197-9 is immediately applicable to set functions which are 
integrals of point function and, by approximation, to set functions in general. 

18 The proof of this statement will be included in another publication. 
44This process has been applied repeatedly to measure in product spaces. 
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which we denote by én». Obviously for each n the set of intervals (e,,), 
»=1,-+-,, is a partition of the linear interval 0 S ¢ < 1,—we will denote it 
by 6,()—, and the sequence {6,(t)} is a closed regular sequence of such parti- 
tions. Also 


| Ens | = | Cn» | —_ ta» ine? tari . 


The set of numbers (20) will be denoted by 75 , its compact closure by 7’. Any 
interval whose endpoints are points of 7’ will be denoted by 7 (with a sub- 
sript). If 7 is not the whole interval 0 S ¢ S 1, let its complement consist 
of the (open) intervals {7,}, v = 1, 2,---. Any interval which after omission 
of its left end point is a proper part of an interval 7, will be denoted by 7* (with 
asubseript). An arbitrary interval e of 0 S ¢ < 1 is the sum of an interval 7 
and an interval 7*. 

Now let F(Z) be an arbitrary absolutely continuous function. Our mapping 
process gives rise to a function F(Z) on 0 S ¢ < 1. The latter function is 
defined at first on the intervals whose end points are points of 7). Since F(E) 
is absolutely continuous, F(e) varies continuously with the end points of e, 
and thus we can extend F(e) onto all intervals 7. In particular we can define 
it for all intervals {7,}. Finally we define it on the parts 7* of 7, by linear 
extension 

ror) =|"! pq). 
| % | 
After additive extension the resulting function F(£) exists on all intervals e 
of 0 $¢< 1. Our final step is to show that F(e) is again absolutely con- 
tinuous. We have to prove that existence of a function n(e), lim.o n(e) = 0, 
such that 


a1) D1 Fd | <9(2 lel) 


x=] x=] 


on the assumption that there exists a function mo(e), limsso mo(€) = 0 for which 


l l 
(22) »» | F(E) | < no (= | Ey ) 


the sets (e,) and the sets (EZ) being disjoint. Since subdivision of the intervals 
é, does not decrease the left side of (21), we may assume for the sake of the proof 
that either 1) all e, are intervals 7, , or 2) all e, are intervals i, . In the first 
case we may obviously put n(e) = m(e). In the second case we choose n so 
large that Zetia! i,| < ¢/2. For those 7* which lie in > %.,% we have 
LIFG)| < 42, | FG) | < m(¢/2). The number n being fixed and F(e) 
being co on each 7, we can obviously find an m(¢/2) such that for 
rk C 1% 


DIP} <m (Ql el) 
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This completes the proof of our statement. 


11. The structure of AC 


TuroreM 8. If F(E) « AC, and if {5,} ts a closed regular sequence of parti- 
tions, then 


lim || F3,, — Fs, || = 0, m,n — o, 


The proof will require several steps. 

We shall temporarily call a function G(Z) a step function on {6,} if it is a 
step function on some 4, (v sufficiently large). Our theorem is trivially true 
for such functions. Therefore it is also true for functions which are limits in 
the norm of such functions, since, 


|| Pon — Fon Il S|] FP — Gan |] + [1 Gon — Ga, I] + ILC — @)a, II 
= 2||F —G@|| + ||G%, — G, ||. 


We next apply our mapping process, as described in §10, relative to the 
given sequence {é,} and we thus obtain, on the interval 0 S ¢ < 1, a closed 
sequence {6,} = {6,(¢)} and a function F(e). Clearly, || F(e) || < || F(Z) ||, 
the equality not holding in general. The equality does hold, if F(Z) is a step 
function on {6,}, and therefore 


|| Fs,,(Z) — Fs,(E) ll = || F;,,(e) — Fi, (e) I. 


Combining this with our first step, we see that our theorem will be proved if 
we show that our function F(e) is the limit in norm of functions G(e) which 
are step functions on {6,(é)}. 

Now by the theory of Lebesgue, since F(e) is absolutely continuous there 
exists, for given « > 0, some step function G(e) for which 


(23) || Fe) — Ge) || <«. 


If 6(t) = (e,) is the partition on which G(e) is a step function, we may assume 
that any e, is either an interval 7 or aninterval7*. If e, is an interval 7* lying 
in an interval 7, , then there must be other intervals e, whose sum is the given i, . 
Now F(e) is constant on 7,. Hence replacing G(e) on 7, by the constant value 
of F(e) will lead to a new step function G(e) for which (23) holds too. Carrying 
out this adjustment for all intervals 7*, we will obtain a step function G(e) for 
which the intervals of constancy are intervals 7. Slight variations of the inter- 
vals will make them into intervals whose end points are points of J). Thus 
(23) can be satisfied by a function G(e) which is a step function on 6,(¢). 

This completes the proof of our theorem. 

TurorrEM 9. The step functions are dense in AC, thus AC = Cl (Ri). 

Suppose the theorem were false, and let F(Z) be a function which cannot be 
approximated by step functions. Then there exists a positive constant \ such 
that for each regular partition 6, || F — F;|| > ». Since 


lim || Fs: — Fe || = lim | (F — Fads || = || F — Fs Il, 
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| 
if 
| 


Fy — Fy|| >. \ aa 
In this way we could construct a closed regular sequence for which theorem 8 | 





if 6is given, we can find a regular partition 5’ > 6 such that | 











does not hold. 
parti- As a consequence of these theorems the following theorem could be proved q 
TaeorEM 10. The conjugate space to AC is Vn. If Ge Va, the functional i 
+ © it represents can be written in the form . 
| X(F) = lim [ fi(2) ae. ! 
tisa 3 
; Me A sequence {F,} 1s weakly convergent in AC if the norms || F, || are bounded 
=e and the functions {F,} are uniformly absolutely continuous, and if the limit of ; 
F,(E) exists for each E. i 
If AC is separable it has a basis consisting of step function. it 
iI 
Gy ||. 12. Functions of bounded variation on generated fields ) 
‘hs Let § be a Jordan field which is generated by a module C. The norm ; 
B) || II 1] = sup | f@) | 
, step P 
makes C into a Banach space, its conjugate space will be denoted by Vc. The | 
latter space arises from V, by identification of some of the elements and con- i 
dif traction of the norm. In fact, if G e Vi , then the expression : 
red i 
vhich (24) x(f) = | #(2) aa 
there represents a functional on C, that is an element of V, and its norm is ‘| 
i 
IG lle = sup| fe) aa sll <1. | 
vying Obviously, || G ||¢ S || @||:. Conversely, any element of V¢ can be so repre- ii 
ty sented by at least one element G of V;. In order to find such an element we 
ili extend the functional, without increasing its norm, from the given space C' to 
-ying the larger space Ro , and in the latter space every functional can be represented 
) for by a Stieltjes integral (24) the function G(E) being defined by 
nter- G(E) = Xz). 
Thus In order to avoid the introduction of new symbols we shall denote the elements 
of V; in the same way as the element of Vi. Elements of V; will be called 
equivalent if they represent the same element of AC; equivalence will be denoted 
by of 
4 The element J(Z) = | E | of V; or its equivalents will be called the cdentity 
8 in Ve ° 


The subspace of V¢ which corresponds to the subspace AC of V:—we shall 
denote the new subspace by V4c—is the closure of the set of elements G ¢ Vc 
which are indefinite integrals of elements g eC. It can be shown that an ele- 
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ment G of V¢ belongs to Vac if corresponding to any ¢ > 0 there exists a n(c), 
lim..o n(€) = 0, such that for | f(x)| S$ 1, 


| { 12) a@| = of | 1102) | av), 


More remarkable properties of the spaces V¢ and V4¢ will be established for 
modules on groups. 


III. Ser Functions on Groups 
13. Group invariant modules 


If & is a group, and if C consists of almost periodic functions, it is appropriate 
to assume that C is group invariant; that is if f(z) belongs to C then f(z") 
belongs to C, and if a is an arbitrary element of G, then the functions g(z) = 
f(za), h(x) = f(ax) again belong to C. The generated Jordan field is also 
group invariant; if E belongs to § then the sets EF’, Ea (right-translation of E 
by a) and aE (left translation of E by a) also belong to F, and | FE’ | = | Ea| = 
|aH| = |E|. 

By our definition of equivalence, two elements F ¢ V; are equivalent, if the 


integrals / g(a) dF have the same value for all geC. In particular, F ~ 0, 


if these integrals are all 0. For general C, it would not be easy to decide, 
whether “equivalence’’ implies identical equality, even if we assume that the 
elements F belong to AC. But for group invariant modules this is always 
the case. 


THEOREM 11. If C is group invariant, if F « AC and if / g(x) dF = 0 for 


all g eC, then F(E) = 0. 

The proof will require several steps. 

1) If f(x) eC, then corresponding to any e > 0 there exists another function 
g(x) eC with the following properties: (i) g(x) = 0, (ii) M.g(x) = 1, and (iii) 
putting h(x) = Myg(zy™)f(y), then 


(25) lf —hlh Se. 


In fact the function g(x) can be chosen to satisfy the stronger relation 
lf—hl se” 
2) If F(Z) « AC and g(x) eC, the function 


(26) n(x) = [ (ay) ayF 
again belongs to C, and 


(27) I| All S (Mz | g(x) |)-|| F |. 





18 See von Neuman n [11], p. 463, theorem 17. 
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In fact, if F’ approximates to F in norm then the function / g(xy") d,F’ 


approximates uniformly to (26), and therefore it is sufficient to prove (27) for 
functions F which are indefinite integrals of functions f eC. But our statement 


27) is trivial for h(a) = | g(ayfly) do. 


3) If F(Z) « AC, then corresponding to any e« > 0 there exists a function 
geC with the properties: (i) g(x) 2 0, (ii) M.g(x) = 1, and (iii) denoting the 
indefinite integral of (26) by H(E), then 


(28) |F-Alhi Ss. 


In fact let F’ be the indefinite integral of a function f’ e C such that || F’ — F ||, Ss 
¢/3, and let, by 1), g(x) be such that g(x) = 0, M.g(x) = 1, and 


[ire - Mae) lav < 3. 


This leads to relation (28) if we apply 2) to the function F — F’ and the 
given g(x). 

Now if | g(x) dF = 0 for all g € C, then (26) vanishes identically, and there- 
fore, by (28), || F |l1 S « Hence F = 0. 


14. Orthogonal systems 


Let C be an arbitrary module and {ga(x)} an arbitrary (not necessarily 
countable) orthonormal system of elements from C."* Every f ¢ C has a Fourier 
expansion 


fla) ~ Leavals), ca = | aala)fla) de, 


and so has every element F ¢ V; , namely 


FB) ~ Licaeelt), ca = | Belz) AP. 


The Fourier series of F(Z) should be denoted more adequately by 
Do Caa(E) 


the set function @,(Z) being the indefinite integral of ¢a(z). 

On the one hand, if a sequence of functions F(Z) are convergent in norm, then 
the Fourier coefficients c, are convergent (individually) for each a. On the 
other hand, if f(x) ¢ Re, then by Bessel’s inequality, only a countable number 


——— 


The properties of modules of almost periodic functions to which we will refer are 


Tor Seer. 
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of Fourier coefficients is ~ 0, and their values tend to zero in any arrangement, 
Thus, by Theorem 9, we have 

TuHroreM 12. If F(E) «AC, then only a countable number of Fourier coeff- 
cients are # 0, (and they tend to 0 in every order). 

Counter Exampie. The theorem is no longer true, if F(#) « V;, even if C 
is group invariant. Before constructing a counter example we observe that if 
{~a(x)} is an orthogonal system which is closed in C, then a formal series 


ye Ca Ya (x) 


is the Fourier series of an element G(Z) from V;, with norm < M, if and only 
if the relation 


(29) | Dd fade | < M sup | f(z) | 
holds for any finite sum 
f(x) = bs da Pax). 


In fact, if co = | cate) dG(E), then 


|Desae| =| f se) a@| < || l-sup | fo) | 


Conversely if (29) holds, then the linear functional 


X(f) = D tade 


can be extended onto the whole space C and put in the form X(f) = / f(z) dG. 


Hence, 


ta = X(v-) = | gala) dQ. 


Now let C be the module of Bohr functions and ga(z) = e’. If K(z) isa 
function of bounded variation in —» < x < o, then the expression 


X(f) = [1@ dK (2) fe 


obviously defines an element of Vc , and for its representatives in V; the Fourier 
coefficients are given by 


Ca = [ e ** dK(z). 


If so defined, c, is continuous in a and therefore it is ¥ 0 for a non-ennumerable 
number of values a. 
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The theorem which follows may be compared with v. Neumann’s theorems 
on the uniqueness of Haar measure.” 

TarorEM 13. Let C be group invariant. If an element F(E) of Vc is right 
invariant that is if F(Ea) = F(E) for every a e & (or if it is left-invariant), then 


ment, 15. The Riesz-Fischer theorem 1) 
Admitting set functions F ¢ V2, it is trivial that >> | a, |? < © implies the ; 

coeff existence of a function F such that 
4 

nif C F wD anga(z). 
4 

hat if If we apply this to fields which are generated by group invariant modules C we : 
S obtain a partial generalization of the construction of Besicovitch to almost : 
periodic functions on general groups.” . 

only 16. A uniqueness theorem 
j 





it is a constant multiple of the identity. 
The proof is quite simple. By assumption, for every element g(x) ¢C, : 
| o(2) arce = f 92) area) = f (2a) are). | 

By definition, M,g(z) is the uniform limit of the function \ 
A g(xa;") | 

N v=1 i 7 





for appropriate choices of the elements a,. Hence we conclude 


[ot ar = [ (ou) aP, i 
) dG. , 
or, putting \ = / dF = F(®), 
[o(e) arc) = xMzg(z) =» [ 9(2) aot). 
si Therefore F(Z) ~ \-vE = | £ |, and this was our assertion. 
Our theorem can be generalized considerably. We consider the function 
ve T(a) = F(Ea*) which is defined for a e G and whose value is the element of Vc 
. as represented by F(Ea’). Theorem 13 assumes that 7'(a) is constant. We 
shall now consider the more general assumption that 7'(a) is almost periodic. 
rier Since the space V¢ is metric, many properties of numerical almost periodic 
functions carry over to the function 7'(a)."” The major property which we will 
consider is that it has a Fourier series by which it is uniquely determined. If 
(30) {Gpo(X) } p,omi,+++48 
ible '' A detailed discussion will be given in Section V. 





8 See von Neumann [10] and [12]. 
? See Bochner-von Neumann [4]. 
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denotes an irreducible representation of © of dimension s consisting of almost 
periodic functions, then there exists a system of elements F,,(E) all belonging 
to V; and hence to Vc, such that the sum 


(31) s y Ppo(@) F pe(E) 


is the contribution of the matrix (30) towards the Fourier series of T(a). We 
now have the following theorem. 

THEOREM 14. Let C be group invariant. If an element F(E) € Vc is such 
that the function 


(32) T(a) = F(Ea™*) 


is almost periodic, if the functions ¢,.(x) of the irreducible representation (30) 
belong to C, and if (31) is the contribution of (30) to the Fourier series of (32), then 
there exist numbers yo such that 


(33) F ,.(E) = > er | Pro(x) dv 


for p,o = 1,---,s. If the functions {y.(x)} do not belong to C then the corre- 
sponding elements F,6(E) are equivalent to 0. 

Obviously, if F(Za™) is constant, and g(x) = 1 is the identical representation 
of the group, then the term of 7(a) corresponding to that representation is 
g(a)-F(Z) = F(E), and hence we conclude from theorem 13 that F(E) ~ 


d / 1-dv = | E |, which is the assertion of theorem 13 for this case. 
E 


For the proof of (33) we use the fact that F,,(Z) is the limit, in the norm 
of V~, of sums 


= 3 F (Ea;") o0(ay) 


for appropriate elements a,. Hence, for a fixed element g(x) eC, / g(x) dF ,o(E) 


is the limit of 
(: p> 0) (te) AF CE). 


The constants a, can be chosen in such a way that the integrand approximates 
uniformly in z, to 


GJoo(x) = M,(g(xa)¢,.(a)). 


Hence we obtain 


| oe) av.) = f g,e(2) aF). 
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Putting za = b, we obtain 


drol2) = Mol) rel2 0) = Do on(z) Mu(Q00)00(0). 
Hence, defining 
her = f @ee(0™) dP (E), 


we will have 
/ g(x) dF,.(E) = x Nor / g(x)Pro(x) dv, 


for all g(x) « C; but this implies (33). 
If v,.(z) does not belong to C, then 


/ g(x) Gre(x) dv 


vanishes, and therefore, F,.(E) ~ 0. 


17. A criterion for absolute continuity 


From the last theorem we can easily conclude 

THEOREM 15. Let C be group invariant. If an element F(E)«€ Vc is such 
that the function (32) ts almost periodic, then F(E) ¢ Vac. 

Since (32) is almost periodic it can be approximated (uniformly in a) by 
finite linear combinations of expressions (31) with numerical coefficients. 
Putting a = 1, we see that F(£) is the limit, in the norm of Vc, of finite linear 
combinations of elements F,,.(Z) with constant coefficients. But, by the last 
theorem, F,,(#) is the indefinite integral of an element from C. 


IV. GENERALIZATION OF A THEOREM OF M. RIESz 


18. Symmetric orthonormal systems 


Let f(z) be a real integrable function of one variable in 0 S x < 2m, and 
let its Fourier series be denoted by 


a + > (a,e"* + ae”). 
Let f(x) be the function whose Fourier series is 
—1 } (a,e"* — a_,e””) 
provided the function exists, and let f*(z) be the function f + if whose Fourier 
Series is 


a +2>, a,c". 


y=l 


j 
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The theorem of Riesz we are interested in states that if f(z) belongs to the 
Lebesgue class L,, p > 1, then the function f(z) (and hence f*(z)) also exists 
and is again a function of L, . 

The known proofs of the theorem are based on properties of analytic func- 
tions of a complex variable, the function 


a + 2 > a, eur 
being analytic in the half plane y < 0.” In the present section we shall outline 
a proof which is not so restricted, and which applies to orthonormal systems 
other than {e”’}. 

We assume that in an arbitrary module C there exists a system {¢,(z)} 
which is orthonormal on the generated field § and has the following properties. 

1) One of the given functions is the constant 1, and it will be denoted by 
go(x). Thus g(x) = 1. 

2) The other functions fall into two families. If a function g.(x) belongs to 
the first family we shall write symbolically a > 0; if it belongs to the second 
family we shall put a < 0. Corresponding to any element ¢,(x) of each of the 
two families there exists precisely one element of the other family, which will 
also be denoted g_,(x), such that 


g(x) = y,(z). . 


3) Finally, and this is the crucial property, if g(x) and ¢,(x) belong to the 
same family, then their product ¢)(x)-¢,(x) is a finite linear combination 


> Cy Dv (x) ’ 


with constant coefficients c,, of elements ¢,(x) belonging to the same family; 
or, more generally, it is the uniform limit of such combinations. 

An orthonormal system with these properties will be called a symmetric 
system. 

Obviously the system {e'"*} is symmetric, an element belonging to one family 
if n is positive, and to the other family if n is negative, since "oo ail 
e™*"* The exponent n may be an integer as in the case of periodic functions, 
or a dense or continuous parameter as in the case of almost periodic functions. 

We can also consider the case of several variables, say e°’"*"™. We obtain 
two families if we take in the (m, n)-plane any fixed line passing through the 
origin, say pm + on = 0, and if we put into one family those pairs (m, n) for 
which pm + on > 0 and into the other family those for which pm + on < 0. 
The pairs which lie on the line itself, form themselves a one-dimensional or- 
thogonal system, and fall into two sub-families as described above, excepting 
always the pair (0, 0). Adding each sub-family to one of the larger families, 
will make the total orthonormal system symmetric. The pairs (m, n) may 


inz 








20 Compare Zygmund [18], p. 192. 
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again be any two-dimensional lattice which is representative of a module of 
(almost) periodic functions in two variables. 
The extension of the construction to more than two variables is immediate. 


19 


We shall now formulate two theorems. 
TueorEM 16. If {ga(x)} ts a symmetric orthonormal system, and if {aq} is a 
system of numbers of which only a finite number are ¥ 0, then for p = 2 


oe |Z aeeell> $ Apll D aavell 


the constant A, depending exclusively on p. 
If for q > 2, the system {ga} is closed in R, , then (34) also holds for p = q/q — 1. 
Denoting by A, an absolute constant which is not always the same we easily 
see that relation (34) implies and is implied by a similar relation in which 
a> Ois replaced bya £0,ora <0,ora 30. Putting 


(35) f(x) = aogo + > (Gaga + d—a9-a) 


then 


f+f=a+a+ 2 (aa + a-a)ee + (aa + da)p—a) 


36 es a dan 
” f—f =a — G+ DY ((da — a_a)a + (aa — Ga) ¢-a): 


a>0 


Thus, by (34), 


(37) | (az + a_a)¢a|| S Ap|lf +f \| S 2AoIIF lp 
(38) | (az — a_a)e-a|| S Ap|lf —f\| S 2Ae lf lo- 


Conversely, (37) and (38) implies (34). Therefore it is sufficient to prove our 
theorem for real functions (35). In this case we can write 


f = do + » (dava + da ~—a) (ao real). 


Introducing the functions 


f= —i 2) (dave on ap—a) 
fr=f+if=m+2 2 dope 
Ss = Do tava 


we have 


2fx = f* — a, 


ii 
: 
' 
: 
' 
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and since (34) is equivalent with || fx ||, S Ap||f|lp and |q| < 
| \seeot2) |dv < ||f |p, it is sufficient to show that || f*||, S A,||f|I,, 
and this again is equivalent with 

(39) IF llp S AvllS Ilp- 


Now if p is a positive integer, then fy is a uniform limit of linear combination 


Dd Cava: 


a>0 
Since ¢.(x) for a = 0 is orthogonal to g(x) = 1, we hence conclude that 
M.(f%) = 0, and consequently that 
(40) M.(f*) = a0. 
We now assume that p is an even integer 2k (k = 1, 2,---). The real part 
of (40) is 
; 2k 2k 2k nai 
M.Aj*) - (F) wag" fh) + ( 7 ) ana ft) — = (-1)fabt. 


Since, for 0 < r < s, the Holder inequality 
|M.GT™) | S (UF lle)” (UF lle)” 
holds, and | ao" | < (|| f ||2x)”, we readily conclude that the quotient 


Y = || lle = | F lle 


does not exceed the largest root of the equation 


y* ~ (7) y>* me! (7) yo bia te 4 ait 2 Es 0, 


and this proves (34) for p = 2k. 
Putting 


K(a, y) = —1 > Val Gar) Ga(y) — ~—a(t)~—aly)) 
where ye = 1 if da ¥ 0, and y2 = O if a, = 0 we have 


f(x) = M,(K(a, y)f(y)). 


Relation (39) can now be easily completed for 2k < p S 2(k + 1) by a funda- 
mental inequality which is also due to M. Riesz.” 
In case 1 < p < 2 we consider any real finite sum 


g(x) _ bo go + x (Daa + bae—a) 





*1 See Zygmund [18], p. 192. 
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and the corresponding function g(x). Obviously 
| M.(gf) | = | Me(Gf) | S |G lle IIS Ilo S Aa lig lle IlF lle. 


Since, by assumption, {ga} is closed in R,, the relation 
| Mz(9f) | S Aallg lle IIS Il 


will hold for any ge R,. The statement of our theorem follows now if we put 
g(a) = sign f(x)-| f(z) P-. 

By a similar argument we now obtain 

TazorEM 17. If {ga(x)} is symmetric and closed in V,, p > 1, and for 
1 <p < 2 also closed in V,, q = p/p — 1, then corresponding to any function 


F(E) id > AaYa 


in V, there exists a function 
F *(E) oe Z da Pa 


a>0 


which again belongs to V,, and || F* ||, < A,|| F'||,. 


20. Special cases 


tax 


If C is the module of Bohr functions and ¢,(x) = e’”’, then the Fourier series 
(41) Lo dae" 
a>0 
of any function F(Z) « V,, p > 1 can be “bisected” within the space V, into 
the parts }vaso and Dve<o. If 8 is any fixed real number, then )>. ase” ”” 
is again a Fourier series, the corresponding function being 


G(E) = / e-®* dF (E). 


Hence the Fourier series (41) can be bisected at any point 8, thus giving rise 
to the Fourier series }va>g and > a<s. Also, for any numbers £ and 7, 


|| p> ae" lp = A>|| z dae" | |p. 
Bsasy a 


In the case of almost periodic Fourier series of two variables 
(42) z Cas ei (artby) 

ap 
belonging to V,, we conclude that they can be cut along any straight line 
pa + o8 = 0 and, by translation of the origin, along any line pa + o8 = + 
with arbitrary p, o, 7. If in the original series (42) only integers a, 8 occur 
then the two parts of the series will also contain only integer exponents and 
hence belong to the Lebesgue class L, on the torus 0 S x < 27,0 S y < 2z. 
Hence we conclude that the Fourier series 

2. Sl aia 





























ards acetic snes Nasi 
RA. eNO BET 


LS Vr STIS. se 








794 S. BOCHNER 


of functions f(z, y) belonging to L,, p > 1, can be bisected along any line 
pm + on = 7 in the (m, n)-plane and not only along the lines passing through 
the origin. 


V. Functions oF BOUNDED VARIATION IN —®© <2 < 


21. Besicovitch functions 


Throughout the present section C will denote the module of Bohr functions 
in —«°© < 2x < ©, and all set functions will be defined on the field § which is 
generated by this module. Therefore, if f « C, its mean value M,f(z) is given by 


lim sn | fat (T + «) 
and if 
(43) > ane 


is the Fourier series of f(x) then 


T 
(44) ad, = lim— | e f(t) dt, —~x <a< om, 
T 
According to our general definition we have || f ||, = (Mz | f(x) |”)"”. Any 
e quence of elements {f,} from C for which 


(45) lim || fm — fn || = 0 (m,n — ~), 


determines a limit element belonging to V, (p > 1) or AC respectively, and the 
limit element has again a Fourier series of the form (43). According to our 
general theory the limit element is a set function F(Z). However in the special 
case we are dealing with the theory of Besicovitch enables us to “represent” our 
element F(Z) by a point function f(x). In fact if the elements f,(x) from C 
satisfy relation (45) and if (43) denotes the term-by-term limit of the Fourier 
series of f,(x), then there exists a function f(z) which is Lebesgue integrable 
over any finite interval and for which (44) holds for all real a. Also 


\| P(E) |lp = (iim 5 f: if) Pat) 


We now start, conversely, from an “arbitrary” function f(x) which is in- 
tegrable over any finite interval and for which the limit (44) exists for every 
real a; or more generally, from an arbitrary function F(x) which is of bounded 
variation over any finite interval and for which the limit 


T 


- 1 
46 a 
(46) de = lim oT 1. 


exists for all real a; and we set up the series (43) with these coefficients. The 
series (43) is not a Fourier series by definition. However, under fairly general 


e **' dF(t) 
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conditions there exists a set function F(Z) whose Fourier series coincides 
with (43). 

TazorEM 18. In order that (43) is the Fourier series of a function F(E) 
belonging to Vy tt is necessary and sufficient that for every exponential polynomial 


(47) g(x) = > bac’ 


the relation 





(48) lim oF be di g(t) dF (t) E M. “sup | g(x) | 


holds. In particular it is sufficient that 


(49) lim so |. "dF | < M. 


In order that F(E) belongs to AC it is sufficient that, moreover, the function 


(50) e(h) = sup 


lo|s1 





lim sn [oD ale +») ~ FO) 


is almost periodicin —-2x7 <h< «. In particular it is sufficient that, moreover, 
the function 


() ex(h) = lim of [dle +) — FCT) 


is continuous at h = 0 that is limp eo(h) = 
In order that F(E) belongs to V, it ts necessary and sufficient that 


(52) lim sn [aro 


<M.-\|g|l., q = p/p—1. 








In particular, if F(x) = / f(x) dx it is sufficient that 


oF Pdt < M” 
lim =o |. “14 | dt = M’. 


Proor. We remarked in §14 that (43) is the Fourier series of a function 
F(E) ¢ V; if and only if for any exponential polynomial (47) the relation 


(53) | >> b.aa| S M-sup | g(z) | 
holds. But, on account of (46), 
> Ea. = tim sr [. g(t) dF(t), 


and therefore (48) is equivalent with (53). 
The numerical function e(h) is the “translation function” of the abstract 
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function F(Z + h) whose values are elements of V~. The latter function is 
almost periodic if and only if e(h) is almost periodic. Therefore, by theorem 15, 
F(E) belongs to AC if e(h) is almost periodic. e(h) obviously has the properties 


(54) e(h) 20, e0)=0, e(hi + he) S e(hr) + e(In). 


Furthermore, e(h) S eo(h). Therefore, if eo(h) is continuous at the origin, then 
so is e(h). But a function e(h) with the properties (54) which is continuous 
at the origin is almost periodic,” and this proves our statement involving the 
function e(h). 

Finally the last part of the theorem is a consequence of the fact that a func- 
tion F(E) belongs to V, if it represents a functional on V,. 


22. A counter example 


By the theory of Besicovitch all functions F(Z) which belong to AC can be 
represented by point functions. We are going to show that this is no longer 
true for all functions which belong to V,. If the Fourier coefficient a, is 
defined by (46) or by some similar formula it is Lebesgue measurable as a 
function of a. However we shall exhibit a function F(Z) whose Fourier coeffi- 
cient is not so measurable. 

Let {&-} be a Hamel basis for real numbers. Any real number a can be 
represented uniquely in the form >: p-é, with rational coefficients pr of which 
only a finite number are ~ 0. If (47) is an exponential polynomial we may 
(and we shall) assume that the index a ranges over a lattice 

k 
Me gr 
a= > Nr N 
where JN is a suitable positive integer and the symbols n,, --- , nm, range over 
all real integers. We now take an arbitrary function y, = A(é,) > 0 and we put 











k 
A sii i | n | 
(a) > [me | N | 
and | 
(55) a, = ¢ 4, 


For ¢t > 0 the series 


co) 


> exp {—|n|¢+ inz} , 


n=—ooO 


converges absolutely and uniformly in —« < x < © towards a positive sum, 
and therefore the expression 


P(z) = , iw e 4@) ,fow 


af > exp {—|ne|| + in, Eh 


Nr 
r=1 n-=—o N N 
defines a positive almost periodic function. It is easy to verify that M,P(x) = 1. 








22 See Bochner [8], p. 136-7. 
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Now 


> bade we M. (d eg) g(x)) 


and therefore 
p> bade | < M.P(z).sup | g(x) | = sup | g(z) |, 


and hence we conclude that the expression (55) is the Fourier coefficient of a 
function F(Z) belonging to V:. However it is possible to choose the function 
n» = A(é-) in such a way that the resulting function A(a) is not Lebesgue 
measurable in aw.” Q.E.D. 


23. Linearly independent exponents 


Finally we shall briefly consider the case in which the exponents a@ for which 
the coefficient a, is different from zero are linearly independent; in which case 
the series (48) can be written in the form 


(56) Dd a(é,) 


the system {£,} being an appropriate Hamel basis. Putting g(¢, 2) = e“* and 
assuming that m S me--- S nm, fi S f2 +--+ S fn, then the integral 


(57) M.o(m, x) +++ (nm , 2)-p(f1, Z) -++ O(Sn, 2) 


has the value 1 if m = n and n, = {-, r = 1, ---, m, and the value 0 in all 
other cases. Consequently, any finite sum 





fle) =  al,ol&,, 2) 


r=1 


satisfies fork = 1, 2,3, --- the inequality 
(58) IF Ile SD | FIle. 


In fact,” if we substitute the values of (57) we obtain 


Me sta) * =D (Sto Fm acy (| al) I") 


M!++* Ns 





the sum ranging over m, = 0, --- , nm, 2 0,m+--- +n,=k. Therefore 


M. | f(z) |" s ki (= | a(é;) ') = kts ll. 


Combining (58) with Holder’s inequality we obtain 
WS lle SUFI. WF SF IN Us lle” 


—_ 


*® See Sierpinski [11]. 
* Compare Kaczmarz-Steinhaus [8], p. 131. 
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and thus 
(59) fle $4 /|F|la- 


If (56) is the Fourier series of F(Z) we choose a finite number of elements 
£,, +--+, & of the Hamel basis and we form the kernel 


K, (2) = af > (1 es ay givtrs 


r=1 y=—n 


and with this kernel the point function 


f(x) = [ K,(2 — t) d,F(E). 


Since K,(x) 2 Oand M,K,(xz) = 1, it is easily seen that 


WF lh S|] F lh. 


On the other hand the Fourier series of f(x) is 


8 


= 1 a > a(é,) e** 


n r=] 





and therefore by (59), 
wo Xf 28 
(*=4) Sloe) fs 161 F It. 


The integer n being arbitrary we finally obtain 


> | a(é) ? < 16 || F|l?. 


In particular we may draw the following conclusion. 

THEOREM 19. If F(x) is of bounded variation on every finite interval, if (49) 
holds, if the limit (46) holds for all real a, and if the indices « for which a, ¥ 0 
are linearly independent, then the set of these indices is countable, and the series 


DX | ae ? 
a 
1s convergent. 


PRINCETON UNIVERSITY. 
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ON A NECESSARY CONDITION FOR THE STRONG LAW OF LARGE 
NUMBERS 


By Paut R. Hatmos 
(Received April 3, 1939) 


Let 2, t2, --- be a sequence of independent chance variables. Define the 
chance variable Z, to be x, whenever | z, | S n, and to be zero otherwise (for 
n=1,2,---). Writeb, = E[(z, — E(%,))’].’ We shall suppose that 


n—>00 n 





The following theorem concerning these chance variables is an easy consequence 
of a result due to Kolmogoroff.” 
THEeoREM I. The two conditions 


° Tn _ = 3 
(2) P( lim das o} 1 
and 
°o Dn 
8) aa°" 


imply that 


(4) P) im a + — tm, 0 = 1, 


no 





n 


Proor: Since > ? os < «, the immediate application of Kolmogoroff’s 
result tells us that 


(5) P\ tim %, — ECG) + +++ + &, — EG) _ o} ms. 


n—>0o n 





Conditions (1) and (5) imply that the sequence {Z,} satisfies (4). It remains to 
prove that this in turn implies that the sequence {z,} satisfies (4). This can 
be done by proving that the sequences {Z,} and {2,} are equivalent in the sense 





1 E(x) is the expectation of the chance variable z. 
2C. R. Acad. Sci., Paris. i191(1930) pp. 910-911. 
3 If pis a proposition, {p}is the event “pis true’ and P{p} is the probability of {p}. 
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of Khintchine.* In other words we are to prove that 


(6) >» P{an ¥ En} < &. 

But Piz, ~ En} = P{|z,| > n}. Hence, if (6) is not satisfied, by the Borel- 

Cantelli lemma’ the probability is one that an infinite number of the conditions 

|z,/n| > lare satisfied. Since this is impossible, by (2), the theorem is proved.° 
The converse of this theorem is not known. It is the purpose of this note to 

make a contribution in this direction, by proving the following theorem. 





TaroreEM IT. Pt + ot + oh = 1, then ito} = land 





(7) »» 1 < © for alle > 0. 

The first part of the theorem is an elementary property of (C, 1) convergence. 
It is necessary to prove (7) only. It is clear, of course, that (7) does not imply 
(3). According to (7) the series in (3) may diverge, but it may not diverge too 
rapidy. In most cases it is not difficult to verify (7). 

For the purposes of the proof we assume that the chance variables z, are 
represented as measurable functions on a space Q, so that probability corresponds 
to a probability measure, P, defined on a Borel field of subsets of 2, and the 


expectation of the chance variable x corresponds to the integral | z dP." 


Proor OF THEOREM II. The measure of the set of points of 2 where (#; + - - - 
i,)/n — 0 is easily derived from Kolmogoroff’s triple limit condition for con- 
vergence almost everywhere.” We obtain 


(8) pat see \ s lim lim tim PA) | Sa ---, 


70 no N-0 








Ent -:: +n 
N <1} 





We consider, for the time being, » > 0, n and N as fixed. Fork = n,---,N 
let F; be the set where 


(9) 




















‘Fréchet, M. Généralités sur les probabilités. Variables aléatoires. Paris, 1937. p. 251. 

‘Ibid. p. 27. 

* Actually Kolmogoroff’s result can be stated slightly more generally, but it is here more 
convenient to state theorem I in its present form in order better to exhibit its relation to 
theorem IT. 

"For a discussion of the representation of sequences of chance variables see J. L. Doob, 
Stochastic processes with an integral-valued parameter. Trans. of the A. M. 8. 44(1938) 
pp. 87-95. See also P. R. Halmos, Jnvariants of certain stochastic transformations: the 
mathematical theory of gambling systems. Duke Mathematical Journal. 5(1939) pp. 461-478. 
* Math. Annalen. 99(1928) p. 315. 
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Then F, D Fry D --- D Fv D Fy, and if we write Z, = CF,, E,,, = 
F,-CFau, eee , Ew = FysCFy iy then we have 


(10) P(Fs) + IP) = 


We define, fork = n,---,N 


: 
1 
\ af (11) a. = [Gat sani + z)dPr, = PU, ) 4 a, aP, 


where a =  — / #dP and op = 2n + --- +2. (We shall also write s, = 
En + --- + & fork = n,---,Nj on+1 = Sn+ = 0). Then, on the set F,, 


(k =n,---,N), 
Sk — [ sap — pata 


—— 








lo. — %| = 


| sk | + = 














bias r aaP| 





i | ) (12) = 





UMS Ss kn + = -kn - P(Fy) = 2k. 


Bi, ,) 


If we set @n_1 = 0, wehavefork = n,---,N 


| 
| a, — Gail = || oxdPr,— [usdPr, 














hit ) Loni 
1) : <= | fa dP», +| | s.aPr, — | s.aP ~ [ wadPr,., +] s.dP 
: if ' i (13) s 2k + spe +] fsdPr 
= 2k + evr ore / a4 aP 
pr) | Jrp | P(Fy-) | Jrp-1 





S 2k + (k — 1)n + (k — 1)n S 2k(1 +»). 
Similarly, fork = n, --- , N, we have, (setting F,_; = 0), 


i (ox — as)* dP =| + § (ox — ay)? dP 


Fe I (ox — a4)’ dP + [(ox1 — Gea) — (ae — ea) + zx) dP 
(14) es a 
</ (o. — a)’ dP+ / [2(k — 1)y + 2k(1 + 9) + 2k] dP” 
Fx Ex 


< i (ox — ax)? dP + P(E,) (4k(1 + n))’. 





° CE is the complement in 2 of the set E. 
10 We note that since | %, | S k, | 2, | S 2k. 
11 Since Ey = Fy1- ‘CF. GC Fi-. 
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dP 
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Finally, for k = eed ,N, 


PR-1 


[ (ox — ax) dP = [(ox—1 -- Qk) — (ax — axa) + x]? dP 


= ((on1 — Axa)” + (ax — axa)® + zi] dP 


FR-1 


IV 


/ (ox-1 — ae)” dP + P(Fy-) / z. dP” 
FrR-1 


=f a—as)'aP + Pls) f ar. 


PR-1 


Combining the inequalities (14) and (15) we obtain 


| [ (x1 — a4-1)?dP + P(Fr) / 2dP 
(16) k-1 


< (ox — ax)’ dP + P(E,) -16k*(1 + 7)’. 


Fk 
Now let e be any positive number. Dividing (16) through by k’** we obtain 


1 
fete - 


(17) 


rs sacs a1)” dP + P(Fy) on zi, dP 


S gan |, (oe — a*aP + PCE)-16(1 + 2% 
Hence, summing (17) for k = n, --- , N, 


> Bi / (6-1 — dea)’ dP + P(Fy) Ez 2 ier 


k=n k2te FR-1 


(18) 





[ (ox — a)" dP + 16(1 + 9)" D PCE).” 


i k=n ite F 


If we write a, = / (ox — ax) dP, then we have (if « < 1) 
Fk 





n 


N—1 1 1 i 
PO) Epes Eo (pe — ace ape) + 


+ 16(1 + 7)’ > P(E;) 


othe 


" Since z; is independent of Fx . 


¥b, = [ sar. 
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N-1 Q+e Q+e 1+e 
na k 2 k werd 
< Da ( a ) + P(Fy) 
k=n 
1 N 
om + 16(1 + n)* 2) P(E.) 
N-1 l+e N 
53 Dew px, + 1601 +9)*| PRs) + 2 P| 
k=n kA ee k=n 
N-1 1 N-1 1 
= 3 Do on ez, + 16(1 + 0) $3 De + 1601 + 9)” 
I 
s K, ges 
where K is a constant independent of n, NV, or 7 (as long as we take, as we may 
without any loss of generality, 7 to be bounded). This last inequality concludes ger 
the proof of (7). For if the series in (7) diverges for any e, then by choosing NV Da 
sufficiently large we can dominate P(F) by arbitrarily small numbers. But this we 
contradicts the assumption that lim,.o lim,,, limy,, P(Fv) = 1. Hence ! 
theorem II is proved. - 
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UBER EINE VERALLGEMEINERUNG DER STETIGEN 
FASTPERIODISCHEN FUNKTIONEN VON H. BOHR 


von B. LewiTan 
(Received January 19, 1939) 


In vorliegender Arbeit wird eine verallgemeinerung der Bohrschen Funktionen 
gegeben, in derem Gebiete der Eindentigkeitssatz bestehen bleibt.’ 

Die Arbeit zerfallt in zwei Teile. Im ersten Teile untersuche ich die verall- 
gemeinerten fastperiodischen Funktionen und beweise den Eidentigkeitssatz. 
Dabei spielen eine grosse Rolle fastperiodische Mengen ganzer Zahlen, von 
welchen schon Besicovitch oft Gebrauch gemacht hat. 

Im zweiten Teile beweise ich, dass beschrankte Lésungen linearer Differential- 
gleichungen mit fastperiodischen Koeffizienten unter geniigend allgemeinen 
Bedingungen Funktionen sind, welche in unserem Sinne die verallgemeinerte 
Fastperiodizitaét besitzen. 


Kaprret 1. User pen EINDEUTIGKEITSSATZ 


i. Es sei Z eine relativ dichte Menge ganzer Zahlen {7;} (¢ = 0, +1, +2, --- ; 
rt; = —7;). Wir werden sagen, dass E bis auf 7 (>0) fastperiodisch ist, falls 
man, aus E eine solche relativ dichte Untermenge E’ auswahlen und eine solche 
Zahl I) > 0 angeben kann, dass fiir jedes Interval (a, b) welches linger als Jp ist, 
die Punkte der Menge E-(a, b) nach der Verschiebung um jede Zahl +r C E’ 
in Punkte von # tibergehen, mit Ausnahme héchstens von (b — a) aus diesen 
Punkten. 

Wir werden einfach sagen, dass E fastperiodisch ist, wenn E fastperiodisch 
ist bis auf jedes » > 0. 

Es sei f(x) fastperiodisch im Sinne von H. Bohr und sei ferner E, die Menge 
aller ganzer ¢-Verschiebungen der Funktion f(z). Dann ist, wie bekannt,” 
E, fastperiodisch fiir fast alle e. 

Satz I. Der Durchschnitt von zwei fastperiodischen Mengen E und E® ist 
auch eine fastperiodische Menge. 

BrweIs. Es sei E eine fastperiodische Menge von ganzen Zahlen 

E = {7;} (¢ = 0, +1, +2, --- 57s = — 74). 

' Vorliufige Mitteilungen habe ich ohne Beweis in den C, R. der Akademie des U. 8. 8. R. 
publiziert (Bd. XVII N 6, p. 287-290, Bd. XIX, N 6-7, p. 447-450) Ausfiihrliche aber auf 
andere Definitionen stiitzende Darstellung habe ich zuerst in den Communications de 
Sciences Math. de Kharkoff serie 4, t. XV2, p. 3-35 (1938) gegeben. Neue Definitionen, 
welche ich hier benutze, erlauben die Darstellung wesentlich kiirzer zu machen. 

*S. Besicovitch, Almost periodic functions, Cambridge, 1932, p. 55-59. 
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Dann ist die Funktion 


fiir nstsnunt+é (0 <6 <1), 


1 
K;(t) = 
@) 0 fiir iibrige ¢ 


fastperiodisch in Sinne von H. Weyl.’ 

Fiir jedes 7 > 0 gibt es, in der Tat, eine Zahl Z > 0 und eine relativ dichte § 
Untermenge E’ C E sodass fiir jedes c die Punkte von E-(c, ¢ + L) nach der 
Verschiebung um jede Zahl aus E’ wieder in Punkte von E iibergehen mit Aus- 
nahme von héchstens nL unter den verschobenen Punkten. Folglich ist fiir 
jedes 7 C FE’ 


1 e+L 
= [| Kilt + 2) — Kilt) | dt $ 205 < 2m, 


was die Fastperiodizitat von K,(t) im Weylschen Sinne beweist. 

Umgekehrt, ist K;(t) fastperiodisch im Sinne von H. Weyl und bilden fir 
jedes » die 7-Verschiebungen von K;,(t) eine Untermenge von £, so ist die 
Menge E fastperiodisch. 

Es mégen jetzt K$” () und K}” (t) den beiden Mengen E" und E® entsprechen. 
Der Menge E”.E” entspricht dann offenbar die Funktion K,(t) = K{”(t)- 
K$?(é) und diese Funktion ist im Sinne von H. Weyl fastperiodisch, weil die 
beiden Faktoren solche Funktionen sind. Die Verschiebungen von K;,(t) 
sind die gemeinsamen Verschiebungen von K$"(t) und Kf(t) und bilden 
folglich eine Untermenge von E®.E®. Daher ist E®-E® relativ dicht und 
fastperiodisch. 

DEFINITION. Eine in — ~ <2 < & sstetig erkldrte Funktion f(x) heisst 
N-fastperiodisch, wenn fiir beliebige « > 0 und N > 0 eine solche fastpertodische 


Menge ganzer Zahlen 7,(e, N) (n = 0, 41, +2, --- 3; 7» = — Tn) angegeben 
werden kann, welche den Ungleichungen 
[f(a t+ rm) —f(z)|<«6 |rt|<N (n=0,+1, +2,---) 


geniigen. 

Nach einem bekannten Satze," ist jede Funktion von H. Bohr N-fastperiodisch. 
Aber schon einfache Operationen mit Bohrschen Funktionen fiihren auf Funk- 
tionen, welche im Sinne von H. Bohr nicht mehr fastperiodisch sind, wahrend 
sie N-fastperiodisch bleiben. 

Es sei, z.B., f(x) eine Funktion von H. Bohr, fiir welche 


f(z) > 0, inf, f(z) = 0; 
(man kénnte 
f(x) = 2 + cos Az + cos Ax 


mit irrationalem );/A: nehmen). 





3S. Besicovitch, p. 92. 
‘ Besicovitch, p. 55-59. 
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Die Funktion 


a ad 
oz) = 7) 


ist im Sinne von H. Bohr nicht fastperiodisch, weil sie nicht beschrinkt ist. 
Aber, N-fastperiodisch ist p(x) gewiss, denn bedeutet 7 eine e-Verschiebung von 
f(z) und ist fiir | x | < N f(x) > x, so besteht die Ungleichung 


| f(z + 7) — f(z) | € 


Ter eae, <ae-s <* 





2 


fir |2| < N und e < min (5, 52). 


Und da fiir f(x) die Menge aller ganzer e-Verschiebungen fastperiodisch fiir fast 
alle e war so ist p(x) wirklich eine N-fastperiodische Funktion. 

Sarz II. Summe und Produkt von zwei N-fastperiodischen Funktionen ist 
wieder eine N-fastperiodische Funktion. 

BeweEis. In bezug auf die Summe folgt die Behauptung unmittelbar aus 
dem Satze I. Um den Satz in bezug auf das Produkt zu beweisen, geniigt es zu 
zeigen, dass f'(x) N-fastperiodisch ist, wenn f(x) N-fastperiodisch war. 

Zum beweise bemerkte man, dass wenn fiir |z| < N 


lf@+r) — flz)|< « 
gilt, 
f(z +7) | < |f(@t+r7) —f(e)|+1f@)|<et+on (|| <Q). 
Somit haben wir 


[fle + r)P — [f(a)P | = | f(a + 7) — f(z) | |S + 7) +f@) | 


6 
” 2gn +1’ 
Insbesondere ist das Produkt einer N-fastperiodischen Funktion und einer 
Bohrschen Funktion eine N-fastperiodische Funktion. 

Da das Produkt einer N-fastperiodischen Funktion mit einer Konstante 
wieder N-fastperiodisch ist so folgt aus dem Satze II, dass die Klasse der N- 
fastperiodischen Funktionen linear ist. 

Insbensondere ist die Differenz zweier N-fastperiodischen Funktionen auch 
N-fastperiodisch. 


Se(Qgvn +e) <6, falls’ ¢ la| <QN. 


2. Es sei f(z) eine in jedem endlichen Intervalle messbare Funktion, und es sei 


(1) fim J, [ 1s) Par < =. 


T-2o 
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Setzt man noch voraus fiir alle reelle \ die Existenz der Mittelwerte 


T 
—ihz 
a(A) = lim oF aT 7 | fae dz, 


so entspricht bekanntlich jeder Funktion f(x) eine Fouriersche Reihe 


f(z) ~ Do Ane 


wobei 


T 
—tAng 
A. = lim =F [ s@ dz. 

Die Zahlen A, bezeichnen wir als Fouriersche Koeffizienten der Funktion 
f(z), und A, als Fouriersche Exponenten der Funktion f(z). 

Es sei nun f(x) eine N-fastperiodische Funktion. Dann gilt folgender 
Fundamentalsatz: 

Satz III. Ist f(x) eine N-fastpertodische Funktion, die der Ungleichung (1) 
geniigt, und existiert fiir jeden reellen der Mittelwert a(d), so kann man fiir 
beliebige « > 0 und N > 0 ein trigonometrisches Polynom P(x) angeben, das die 
Ungleichung 


f(z) — P@@)|<«6 |a|<N 


befriedigt. 

Dabei sind die Exponenten des Polinoms P(x) Fouriersche Exponenten der 
Funktion f(x), wahrend die Koeffizienten von P(x) durch Multiplikation von 
Fourterschen Koeffizienten der Funktion f(x) mit gewissen Zahlen erhalten werden. 

Brewers. Auf N-fastperiodische Funktionen iibertragt sich leicht der Weyl- 
sche Beweis des Approximationssatzes der Theorie der fastperiodischen Funk- 
tionen. 

Es seien e > 0 und N > 0 beliebige Zahlen. Da f(x) N-fastperiodisch ist, 
so ist die Menge aller (e/3, N) Verschiebungen der Funktion f(x) fastperiodisch. 
Fir diese Menge bilde man die Funktion 


o|— 


fiir mnSss7r+86 (¢=0,+1, ---;7-1 = —71) 
K;(s) = 


0 fir tibrige s 


Wahlt man 6(<1) so dass | f(z + h) — f(x) | < ¢/3 fir |h| < d6und|z| <N, 
so hat man 





1 Tn t6 1 
n+ L,, f(x + s)K;(s) ds = mi Zz ; bat gy + s) ds 
1 > 


erie Dif sett ads. 
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Nach der Eigenschaft der Zahlen 7; und nach der Wahl von 6 hat man 


} [tet n+ 9d0—se))| <* (j2| <Q). 
Folglich 
(3) ey Dif fetntoa—se| <™ (|2| <Q). 
Aus (2) und (8) folgt: 
1 mts 2 
(4) ln +1 f(x + s)K;(s) ds — f(z) <3 (\z| <N). 


Nun beniitzen wir die Fastperiodizitét im Weylschen Sinne der Funktion 
K;(s)—. Da K;(s) beschrinkt ist, so gehért sie zur Klasse’ W*. Folglich kann 
man fiir jedes » > O solche Zahl 7) = To(n) und solche endliche trigonomet- 
rische Summe® 

gy 


>> ae? 


k=1 


aufstellen, dass 


2 
ds <7 


—idgs 
ne ge 











k=1 


1 T 
rl, 
fir 7’ > T> ist. 
Nach der Schwarzschen Ungleichung hat man: 
gil teto[ Ko - Lac] as} 
lon 1 ¥ f(lt+s ols) — 2, ane s 


c. 2rn 1 cite 2 | 
sta acd. | f(x + s) |’ ds 


sis Orn y os 
(2n+1 2, 
Nun zeigen wir die Existenz des Limes 


lim a lim ™ 
no 2n 4 1 n-?>cO n : 


Ve, 
or 
~~ 

lA 














I. 


g i 2 
K;(s) -— > axe ™| ds 








In der Tat, da K;(s), als eine im Weylschen Sinne fastperiodische Funktion, 
ein Mittelwert besitzt so ist: 





: ” _ 2n+1 
lim + K;(s) ds = lim + : 
n—>00 27. Tr no 2Tn 
* Besicovitch, p. 77. 
* Besicovitch, p. 91. 
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Bezeichnet man 


oh iin =e 
M= lim A “1 s@) ? dx, m = lim 


m-—co 7 


und nimmt man 
2 


€ 
9Mm? 
so kommt man nach Beriicksichtigung von 
1 tté g » Vv 3 
mer. ere (= mete *) ds = 21 bee = P(z) 


wo 


T 
—ihpe 
be = maui or | fe de 


und (4), (5) zur Beziehung 
\f(z)- P@)|<e (|| <WN) 


was zu beweisen war. 

Aus dem Satze III ergibt sich unmittelbar der Eindeutigkeitssatz. 

Satz IV. Geniigen zwei N-fastperiodischen Funktionen f(x) und g(x) der 
Ungleichung (I) und haben sie dieselbe Fouriersche Rethe, so sind sie identisch. 

BeweEts. Nach dem Satze II ist die Differenz h(x) = f(x) — g(x) eine N- 
fastperiodische Funktion; ihre Fouriersche Koefficienten sind alle gleich Null. 
Auf Grund des Approximationssatzes ist also h(x) = 0. 


3. Gehort eine N-fastperiodische Funktionen zur Klasse’ B’ so erfiillt sie die 
Ungleichung (J) und fiir sie existiert die Funktion a(a). 

Mann kann auch eine notwendige und hinreichende Bedingung dafiir angeben, 
dass fiir eine Funktion, die der Ungleichung (J) geniigt, das Limes 


ae 1 [ —idz 
a(v) = lim oF Ame dx 


existiere fiir alle reelle \, so wie auch eine notwendige und hinreichende Beding- 
ung fur gleichmassige Existenz der mittelwerte (fiir jedes \).8 

Andererseits, gemeinsam mit B. Levine, habe ich konstruiert ein Beispiel einer 
beschrinkten N-fastperiodischen Funktion f(x), fiir welche der Mittelwert 


hm oF ol, f(z) de 


nicht mehr existiert. 





7 Besicovitch, p. 77 und p. 100. 
® C. R. der Academie des U. 8. S. R., vol. XIX, N 6-7, p. 447-450, Com. des Sc. Math. de 
Kharkoff, serie 4, t. XV2 , p. 3-35 (1938). 
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Dieses Beispiel wird in den C. R. USSR veréffentlicht. 

Demgemiss erscheint es uns natiirlich, eine Verallgemeinerung des Mittel- 
wertbegriffes zu unternehmen. Am allgemeinesten lisst es sich durchfiihren, 
wenn man vom verallgemeinerten Limes von 8. Banach Gebrauch macht.’ 

Sei f(z) eine der Ungleichung (J) geniigende Funktion. Alsdann ist F(7) = 


- 
a [ f(z) dx eine fir alle T beschrinkte Funktion. Man Bezeichnemit Lim das 
T 


verallgemeinerte Limes von 8. Banach. Unter den Mittelwert M{f(x)} der 
Funktion f(x) wollen wir verstehen Limy.,.. F(T’). 

Aus den Eigenschaften des Verallgemeinerten Limes von 8. Banach ergibt 
sich, dass der soeben definierte Mittelwert volgende Bedingungen erfiillt: 
1) M{af + bg} = am{f} + bm{g}, (a b sind konst., f(x) und g(z) geniigen (J)). 
2) Mij(z)} = 0 wenn f(z) = 0. 
3) Mif(xz + 2)} = M{f(z)} (xo ist reell). 
4) M{1} = 1. 


5) Existiert limy - [. f(x) dx, so stimmt es mit M{ f(x)} iiberein. 
Auf tibliche Weise erhalt man alsdann die Schwarzsche Ungleichung: 
| M{f(a)g(a)} | S (M{| f(a) *}-M{| g@) IP 
und bildet die Fouriersche Reihe 
f(a) ~D Ane*™”, = An = M{f(z)e™*"}. 


Ks gelten fiir diese Fouriersche Reihen die Satze III und IV. Wollen wir den 
Beweis z.B. fiir den Satz III durchfiihren. 

Es sei auf der fastperiodischen Menge 7;(¢/3, N) (¢ = 0, +1, +2, ---) die 
Funktion K;(s) wie folgt definiert: 


fir7; Ss S 7+ 4, 


K;(s) — 5 
0 fiir iibrige s, 

wo 

r = lim — 

no Tn 
Offenbar ist 
rate BEY 2 ae 
M{K,(s)} = lim 5— f K;(s) ds = lim 5 a 1. 


Betrachten wir nun die Funktion 
g(x) = M. {f(x + s)K,(s)}, 


die existiert wegen (J) und der Schwarzschen Ungleichung. 
ES 


*S. Banach, Théorie des operations linéaires, p. 33. 
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Wir haben 
| f(z) — o(z) | S M.{| f@ + 8) — f(z) | Ki(s)}. 


Ist aber | z | < N, so ist in denjenigen Intervallen wo K;(s) ¥ 0 
2 
Ife +8) -f@|<F 


wenn nur 6 geniigend klein ist. 
Somit hat man 


Ise) — g(a) | < MAK) = =. 


Nun approximiere man im Mittel die Funktion K;(s) durch endliche trigono- 
metrische Summe und schliesse den Beweis des Satzes wie im Falle der Existenz 
der gewohnlichen Mittelwerte. 


4. Man kann eine andere Definition der N-fastperiodizitat zu Grunde legen. 

Man betrachte die im Intervalle — © <a < © stetige Funktionen f(z), 
die den folgenden zwei Bedingungen geniigen: 

I. Fir jedes e > O und N > 0 existiert eine relativ dichte Menge reeller Zahlen 
7 = t(e, N) sodass 


[f@ +7) —f@)|<e (j2|< N). 


II. r(e, N) + r(p, N) = 1(6 ;N), wobei 5 = 5(e, p) strebt gegen 0 mit « und p. 

Fir diese Funktionenklasse gilt der Satz II (Invarianz gegeniiber Addition 
und Multiplikation). 

Setzt man noch voraus, dass f(x) zur Klasse B’ gehort, so bleibt auch der 
Approximations—und der Eindeutigkeitssatz bestehen. 

Im allgemeinen, wenn man die Funktion 4(e, p) der Bedingung 


6(€, p) = e+ (p) 


unterwirft wo \,(p) mit p gegen 0 gleichmassig fir alle ¢ strebt, so lasst sich 
zeigen, dass fiir jede « > 0 und N > 0 eine fastperiodische Menge (e, )- 
Verschiebungen existiert, sodass also f(x) die in dieser Arbeit angenommene 
Bedingung erfiillt. 

Ganz ausfiihrlich behandelte ich alle diese Fragen anderswo.” 


Kapite, 2. User LInEARE DIFFERENTIALGLEICHUNGEN MIT 
FASTPERIODISCHEN KoEFFIZIENTEN 


1. In diesem Kapitel behandeln wir einige Eigenschaften beschrankter 
Lésungen der linearen Differentialgeleichungen mit fastperiodischen Koeff- 


1° Com. des. Se. Math. de Kharkoff, series 4, t. XV2 , p. 3-35 (1938). 
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rienten. Wir wollen zeigen, dass unter gewissen Umstiinden solche Lésungen 
1) 
N-fastperiodische Funktionen sind. 
Wir betrachten ein System linearer Differentialgleichungen 


(s:) * = fir(t)ar + fio()ae +--+ + fin(Oan + gilt) (i = 1,2, ---n), 


wo fix(t) a g:(t) sind stetig und reell fiir reelle ¢. 

Fir willkiirlich gegebene Anfangsbedingungen, die durch Angabe der Werte 
1(0, 22(0), --- , @n(O) festgesetzt sind, lisst sich (beispielsweise durch die 
Picardsche Methode der sukzessiven Approximation) eine Lésung finden, die 
diese Anfangswerte annimmt. 

Nun betrachten wir eine Folge von Systemen: 


(p) dx Vi 
(s; ) dt 
deren Koeffizienten konvergieren bzw. zu f;,,(¢) und g;(t) gleichmassig in jeden 
endlichen Intervall, wenn p — © und betrachten wir solche Folgen von Lésungen 
dieser Systeme, fiir welche Anfangwerte 2” (0) — 2;(0) fiir p > ©; dann kon- 
vergieren die Lésungen 2{°(t) gleichmassig in jedem endlichen baloney all zur 
Lésung des Systems (s,). Das ist eine Folgerung der Picardschen Methode. 

Wahlt man aus der Systemenfolge (s{”) zwei Systemen (s;”””) und (s}”?’) so dass 


SPO —$PO|<e Gk =1,2,---,n) 
gi’? t) - gi”? (t) | <e€ (¢ = 1, 2, mo shal n) 


woe > 0 willkiirlich ist, sofolgt aus der Picardschen Methode, dass wenn Anfands- 
werte der Lésungen 2{""’(t) und «{??)(t) jeder Systemen den Ungleichungen 


| af? (0) — 2x{?(0) | < a, (¢ = 1,2, --- , n) 


geniigen, so lasst sich eine Zahl w = w(a, €)(>0) und eine Zahl T(> 0) angeben, 
fiir welche 


(Oar + fie Ome + +s +f/2Orn +90 (@=1,2,---,n) 


(—-o <t<o), 


(1) | af? (t) — xf? (t) | < a, 
fiir 
(2) —-TstsT 


ist, und, umgekehrt, fiir jedes noch so kleine w > 0 und so grosse 7’, lassen sich 
solche Zahlen a und ¢ angeben, dass die Ungleichungen (i) und (2) bestehen. 
Nehmen wir nun an, dass im System (s,) alle Funktionen f;,x(¢) und g;(¢) 
fastperiodisch sind. Es heisse dann das System fastperiodisch. Da die 
Anzahl der Funktionen fix(t) und g,(t) endlich ist, so existiert fiir jedes e(>0) 
eine relativ dichte Menge {r(e)} der ¢-Verschiebungen, die allen Funktionen 





= Was die Methoden und Bezeichnungen anbetrifft, die in diesem Kapitel auftreten, 


vergl. man die Abhandlung von J. Favard, Sur les équations différentielles linéaire 4 coeffi 
cients presque périodiques, Acta Math. t. 51, p. 31-81 (1927). 
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gemeinsam sind. Die Tatsache, dass 7(e) «-Verschiebung fiir alle Funktionen 
4 fix(t) und g(t) ist, bezeichnen wir wie folgt 


| (Str) — (82) | < «, 


i i und es hiesse r(e) eine Verschiebung fiir das System (s,). 


' Mit E,(s) bezeichnen wir ferner die Menge aller ganzen Verschiebungen des 
i Systems (s;). Bekanntlich” ist E,(s) fastperiodische fiir fast alle «. 
. Gleichzeitig mit dem System (s;) wollen wir das homogene System 


| ih a | (2) a = fiat + +++ + finan (i = 1,2, +--+ ,n) 


betrachten. 
Satz I. Hat das inhomogene System (s;) eine beschraénkte, wdahrend das 
System (2.) keine beschrénkte Lésung (mit Ausnahme, selbst verstdndlich, der 
trivialen Lésung x; = 0), so besteht die beschrankte Lésung von (s:) aus N-fast- 
periodischen Funktionen. 
1) jose BewEIs. Sei «, €2, --- eine unendliche Folge unbeschrinkt abnehmender 
| Zahlen, fiir die die Mengen E, (s) fastperiodisch sind. Bezeichnet man die 


ae ‘i Zahlen der Menge &,,(s) mit ru (U = 1, 2, ---), (k = 0, +1, £2, ---), so gilt 
1h ee offenbar 
| 











(3) | (Seton) — (82) | < (i= 1,2,---) (& = 0, +1, +2, -::). 






Ist nun 2,(t) eine Lésung des Systems (s;) so ist offenbar 2,(t + 7) eine 
Lésung des Systems (s;4,,,). Wegen friiher gemachter Bemerkungen und 
(3) geniigt es fiir den Beweis des Satzes I zu zeigen, dass 

a) Die Zahlen x;(7,,) nur einen einzigen Grenzwert 2;(0) fiir unbeschrainkt 
wachende | und k haben; 

b) fiir jedes 7 der Grenzwelt x;(7,) fir 1 — © gleichmissig nach k existiert, 
d.h. es lasst sich fiir jedes a > 0 eine Zahl N(a) angeben, sodass fiir 1 > N(a) 
und fiir alle 7 und k 










| 2s(7) oie x;(0) | <a 







ist. 
Fixieren wir k und lassen wir | unbegrenzt wachsen, so gilt wegen (3) 





lim (St4+),) = (s:). 





Wenn wir annehmen, dass die Folge 2;(71) fiir irgendein 7 und fir fixierten / 
zwei Grenzwerte 2,(0) und 2;(0) aufweise, so wiirde das homogene System eine 









12 Besicovitch, p. 55-59. 
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beschrankte Lésung zulassen, was der Bedingung des Satzes widerspricht. 
Auf dieselbe Weise schliessen wir, dass fiir kein 7 die Grenzwerte 
lim ri(ru,); lim i (Tikg) (ki # ke) 
verschieden sein kénnen. Somit ist a) bewiesen. 
Zum Beweise von b) nehmen wir an, dass der Grenzwert 2,(r,) existiert 
ungleichmassig nach k. Dann lassen sich eine Zahl a > 0 und zwei unendliche 
Folgen angeben 


Lk, eee he eee 
| Se See 
fir welche bei gewissem 7 
(4) | x:(71,%,) — 2:(0)| > @ 
gilt. 
Aus der unendlichen Folge z;(r1,x,) lasst sich aber eine konvergente Unter- 
folge auswihlen, die etwa zu x;(0) konvergiere. Wegen (4) gilt 


|2;(0) — 2(0)|> a. 


Dann wird has homogene System (2,) eine beschrinkte und von 0 ver- 
schiedene Lésung zulassen, was der Bedingung des Satzes wiederspricht. Somit 
ist der Satz I vollstandig bewiesen. 


2. Zum Beweise des Satzes I benétigten wir der Eindeutigkeit der Grenzwerte 
z(tx). J. Favard™ aber hat gezeigt, dass sich diese Grenzwerte fiir gewisse 
Lésung 2z,(t) eindeutig bestimmen auch in dem Falle, wo das homogene System 
(2,) keine Lésung hat, die nach dem absolutem Betrag noch so klein werden 
kénnte (die Lésung x; = 0 ausgenommen). Dabei verstehen wir unter dem 
absoluten Betrag den Ausdruck 


(xi) + 23) + --- + co}. 
Wir haben also: 
Satz II. Hat das System (2,) keine lésung, derer absolute Betrag noch so klein 
werden kénnte (Lésung x; = 0 ausgenommen) wnd hat (s,) beschrankte Lésungen, 
80 besteht wenigstens eine dieser Lésungen aus N-fastperiodischen Funtionen. 


InstTITUT FUR MATHEMATIK, 
Kuarkorr, U.S. 8S. R. 





* J. Favard, |. ¢., pp. 60-61. 
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GRUNDZUGE EINER INHALTSLEHRE IM HILBERTSCHEN RAUME 


Von Kari LOwNER 
(Received February 21, 1939) 


1. Einleitung 


Bei dem iiblichen axiomatischen Aufbau der Inhaltslehre im endlichdimen- 
sionalen euklidischen Raume wird das Lebesguesche Mass yu (2{) einer messbaren 
Punktmenge Y% in seiner Abhangigkeit von Y als eine Mengenfunktion aufgefasst, 
welche folgenden Postulaten geniigt: 

1. Der Definitionsbereich von yu(2M) ist ein o-Kérper K, welcher mit jeder Menge 
auch alle ihr kongruenten enthalt.’ 

2. u(A) ist reell und nichtnegativ. 

3. Ist% = An, An < K(n = 1,2, ---), UM = O(k ¥ I) 80 ist 


w(%) = Do w(An) 


n 


4. Sind A und B kongruente Mengen aus K, so ist 


u(X) = u(B) 


5. Jeder Wiirfel ist messbar und hat einen endlichen positiven Inhalt. 

Die vorliegende Arbeit enthalt einen Versuch, eine entsprechende Theorie 
fir den Hilbertschen Raum auszubauen.* Man sieht sofort, dass das Axiomen- 
system nicht wortlich iibernommen werden kann. Zunidchst muss man im 4. 
Axiom die Wiirfel, welche im H.R.* nicht beschrinkte Punktmengen darstellen, 
durch hier einfachere Gebilde, am besten durch Kugeln ersetzen. Doch auch 
bei dieser Modifikation ist das Axiomensystem nicht erfiillbar. Das sieht man 
so ein: Aus den drei ersten Axiomen folgt bekanntlich, dass u(%) < u(B) ist, 
wenn % in B enthalten ist. Nun sei & eine Kugel vom Radius a > 0. Man 
denke sich nun ein cartesisches Achsenkreuz des H.R., dessen Mittelpunkt mit 





1 Kin System von Mengen heisst bekanntlich ein Kérper, wenn die Summen- und Differ- 
enzbildung aus ihm nicht herausfiihrt. Es heisst ein o-Kérper, wenn sogar die Summen- 
bildung mit abzihlbar vielen Summanden aus dem System nicht herausfiihrt. Der Aus- 
druck ‘“‘Summe’’ wird hier wie stets im Folgenden gleichbedeutend mit ‘‘Vereinigung” 
gebraucht.—Kongruent sollen zwei Punktmenge des Hilbertschen Raumes heissen, wenn 
sie durch eine isometrische Abbildung des vollen Raumes auf sich selbst ineinander tiberge- 
fihrt werden kénnen. 

? Die Anzahl der Summanden kann endlich oder abzaihlbar unendlich sein. 

* Kinen kurzen Bericht iiber die vorliegende Untersuchung hat der Verfasser auf dem 
Kongress der Mathematiker der slavischen Lander, Prag 1934, vorgetragen. Siehe ‘‘Zpravy 
o druhém sjezdu matematikai zemi slovanskfch, Praha 1935.”’ 

‘ “H.R.” ist Abkiirzung fir ‘‘Hilbertscher Raum.”’ 
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dem Mittelpunkt von & zusammenfallt. Um die Mittelpunkte der auf den 
positiven und negativen Achsen liegenden Radien von & schlage man je eine 
Kugel vom Radius b < a/(2./2) (b > 0). So erhalt man eine Folge von Kugeln 
,(n = 1, 2, ---) von gleichem Radius, welche paarweise punktfremd sind und 
alle in R liegen. Bezeichnet man den Inhalt einer Kugel vom Radius c mit 
I(c), so folgt aus dem dritten und vierten Postulat 


w(K: + Re + --- + Kn) = nI(b)(m = 1, 2, ---) 
Es ist also 
nI(b) S I(a) (n = 1, 2, ---) 


Wegen Giiltigkeit des Archimedischen Axioms im Bereich der positiven reellen 
Zahlen ist diese Ungleichung im Widerspruch zu dem modifizierten fiinften 
Postulat. 

Wir sehen, dass eine Abaénderung des Axiomensystems notwendig ist. Gleich- 
zeitig weist uns die eben durchgefiihrte Uberlegung einen naturgemiissen Weg 
hierzu: Man muss entweder den Bereich der nichtnegativen reellen Zahlen durch 
ein System von Gréssen ersetzen, welches dem Archimedischen Axiom nicht 
geniigt, oder man muss, wenn man den Bereich der reellen Zahlen nicht verlassen 
will, die Forderung der Nichtnegativitaét der Inhaltswerte aufgeben. Wir wer- 
den hier den ersten Weg einschlagen. 

Der vorliegende Versuch einer Inhaltslehre in einem Raum von unendlich 
vielen Dimensionen unterscheidet sich von den bisherigen wesentlich darin, 
dass diese alle den Bereich der Inhaltswerte dem System der nichtnegativen 
reellen Zahlen entnehmen. Der Weg, den wir einschlagen, ist kein rein axioma- 
tischer. Wir kombinieren die axiomatische Methode mit geometrischen Be- 
trachtungen, indem wir den Inhalt zunachst nur fiir gewisse Rotationskérper 
definieren, welche als “Verwandte” der Kugel hier eine ahnliche Rolle spielen 
wie die Polyeder im endlichdimensionalen Raum. Indem wir das Cavalierische 
Prinzip als heuristisches Prinzip benutzen, werden wir auf naturgemisse 
Begriffsbildungen gefiihrt. 


2. Geometrische Siatze iiber Rotations—und Rotativkérper 


Unter einem Rotationskérper § des H. R. soll im Folgenden ein Ké6rper 
verstanden werden, welcher alle Rotationen um einen endlichdimensionalen 
ebenen Unterraum a° des H. R. gestattet. Unter einer Rotation um a ist jede 
isometrische Abbildung des H. R. in sich zu verstehen, welche alle Punkte von a 
einzeln festlisst.° a nennen wir eine Achse von ®. 

Um spitere Betrachtungen nicht unterbrechen zu miissen, wollen wir hier 


’Zu den ebenen Raumen zahlen wir auch einzelne Punkte. Wir schreiben ihnen die 
Dimension Null zu. 

‘Wenn man diese Definition einer Rotation bei euklidischen Raumen endlicher Di- 
mensionen verwendet, muss man etwa eine Spiegelung an einer Ebene auch als Rotation 
bezeichnen, 
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zunachst eine Reihe von geometrischen Hilfssitzen tiber Rotationskérper zu- 
sammenstellen. 

1. Hitrssatz: Ist a, eine k-dimensionale Achse des Rotationskorpers §, so ist 
auch jeder endlichdimensionale ebene Raum a, von einer Dimension | > k, welcher 
a, enthdlt, ebenfalls Achse von §. 

Dieser Satz folgt unmittelbar aus der Definition einer Rotationsachse. 

2. Hitrssatz: Die Rotationskérper bilden einer Mengenkérper. Brwets: Der 
Rotationskérper § habe die Achse a, der Rotationskérper © die Achse 6. 
Man bilde den Verbindungsraum c von a und 8, d. h. den ebenen Raum kleinster 
Dimension, welcher sowohl a als 6 enthalt. Seine Dimension ist bekanntlich 
nicht grésser als die Summe der Dimensionen von a und 6 vermehrt um eins. 
Nach dem 1. Hilfssatz ist ce Achse sowohl von ® als auch von ©. Hieraus folgt 
offenbar, dass auch # + S und wenn R < © ist, auch S — MR simtliche Rota- 
tionen um c gestattet. 

3. Hitrssatz: Sind a und b Achsen des Rotationskérpers R, so ist auch der 
Verbindungsraum von a und 6 Achse von R. 

Der Satz ist eine unmittelbare Folge des 1. Hilfssatzes. 

4. Hiurssatz: Sind a und 6 zwei Achsen des Rotationskérpers R und haben 
a und b einen nichtleeren Durchschnitt d, dann ist auch dieser Achse von R. 

Wir beweisen die etwas schdrfere Aussage: 

4’. Hiurssatz: Jede Rotation wm den nichtleeren Durchschnitt d zweier ebener 
Rdume endlicher Dimension a und 6, ldésst sich aus endlich vielen Rotationen um 
a und b zusammenseizen. 

Wir schliessen durch vollstandige Induktion nach dem Wert von s = p + q 
worin p und qg die Dimensionen von a und 6b bedeuten. Da man annehmen 
kann, dass keiner der beiden Riume a und 6 in dem anderen ganz enthalten ist, 
ist der kleinstmégliche Wert von s gleich 2 und wird fiir p = q = 1 geliefert. 
a und 6 sind dann zwei sich in einem Punkt D schneidende Geraden. Hier 
kann man folgendermassen schliessen. Durch eine Rotation B um 6 entstehe 
aus a die Gerade a* ~ a. Sie geht ebenfalls durch D hindurch. Jede Rotation 
A* um a* lasst sich vermége einer zugehérigen Rotation A um a in der Form 
A* = B ‘AB darstellen. Wenn also der zu beweisende Satz fiir das Achsenpaar 
a, a* richtig ist, so auch fiir das Paar a, 6. Sei nun 6 der zwischen 0 und 37 
gelegene Winkel zwischen a und 6b. Der Winkel zwischen a und a* variiert mit 
a* zwischen 0 und 26. Ist 26 = 32, so kann man a* so wahlen, dass der Winkel 
zwischen a und a* gleich 32 ist. Ist aber 25 < 32, so wahle man a* so, dass der 
Winkel zwischen a und a* den maximalen Wert 26 annimmt und verfahre dann 
mit a und a* ebenso wie vorher mit a und 6. Nach endlich vielen Schritten 
dieser Art gelangt man schliesslich zu einem Achsenpaar a’, 6’ durch D, welche 
aufeinander senkrecht stehen und die wesentliche Eigenschaft haben, dass jede 
Rotation um a’ bzw. 6’ sich aus endlich vielen Rotationen um a und 6 zusam- 
mensetzen lisst. Ebenso wie im dreidimensionalen euklidischen Raume be- 
weist man aber, dass jede Rotation um den Schnittpunkt zweier einander 
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senkrecht schneidender Geraden a’, 6’ aus héchstens drei Rotationen um a’ 
und b’ zusammengesetzt werden kann. 

Sei nun p + q > 2. Ist die Dimension d des Durchschnitts } von a und b 
positiv, so ist jede Rotation um } (und um so mehr jede Rotation um a bzw. 5) 
vollkommen bestimmt durch ihr Verhalten in einem vollstandigen Orthogonal- 
raum’ © zu. Die Schnitte von a, 6, ) mit O haben der Reihe nach die Di- 
mensionen p’ = p — d, q’ = gq — d,d’' = 0. Die Summe 8’ = p’ + q’ = 
p+q — 2d ist kleiner als s = p+ gq. Man kann also vollstiindige Induktion 
anwenden. 

Sei jetatd = 0,p > 0,g>0,p+q> 2. Der Durchschnitt ist ein einzelner 
Punkt D. Ohne Beschrankung der Allgemeinheit kann angenommen werden, 
dass p > 1 ist. Man verstehe unter a’ irgend einen durch D gehenden Un- 
terraum von a von der Dimension p — 1. Durch eine passende Rotation um 6 
kann a in eine Lage a* gebracht werden derart, dass a und a* genau a’ zum 
Durchschnitt haben. Die Dimensionen von a und a* sind beide gleich p und 
die ihrer Schnitte mit einem vollstandigen Orthogonalraum zu a’ gleich 1. Da 
die Summe der letzteren gleich 2 ist, kann nach den friiheren Uberlegungen 
behauptet werden, dass jede Rotation um a’ aus Rotationen um a und a* 
zusammengesetzt werden kann. Ebenso wie im Falle p = q = 1 ist jede 
Rotation um a* durch Rotationen um a und b ausdriickbar. Im Ganzen kann 
man jede Rotation um a’ aus Rotationen um a und 6 zusammensetzen. Nun 
betrachte man das Achsenpaar a’, b. Es hat eine um 1 kleinere Dimensions- 
summe als das Paar a, 6. Man kann also wieder vollstandige Induktion an- 
wenden. Da der Schnitt von a’ und 6 gleich dem Punkt D ist, ist der Satz 
somit vollstandig bewiesen. 

Es erweist sich notwendig, die bisher aufgestellten Saitze noch ein wenig zu 
verschirfen. Der H. R. kann ebenso wie der endlich dimensionale euklidische 
Raum zu einem projektiven Raum erginzt werden, indem man jeder Richtung 
einen unendlich fernen Punkt der Geraden zuordnet, welche die gegebene Rich- 
tung besitzt. Man kann dann auch in der iiblichen Weise unendlich ferne 
ebene Raume einfiihren. Ferner kann man den Begriff der Orthogonalitat 
aweier Riume auf den Fall ausdehnen, dass einer von ihnen oder beide ins 
Unendliche fallen; denn schon bei eigentlichen Raumen ist ja die Aussage der 
Orthogonalitét eine Aussage tiber ihre Schnitte mit dem Unendlichfernen. 

Unter einer Rotation um eine unendlich ferne endlichdimensionale Achse a 
verstehen wir jede isometrische Abbildung des H. R. in sich, welche jeden Punkt 
von a einzeln festhalt und jeden vollstdndigen Orthogonalraum zu a in sich 
tiberfihrt. 

Zur formalen Abrundung fiihren noch wir den leeren Raum } = 0 ein, dem 
wir die Dimension — 1 zuschreiben. Unter einer Rotation um b = 0 kann jede 
tsometrische Abbildung des H. R. in sich verstanden werden. Die einzigen Rota- 
tionskérper mit ) = 0 als Achse sind der ganze H. R. und die Nullmenge. 

Nach Einfihrung dieser Begriffserweiterungen kénnen wir die Hilfssitze, die 


Ein vollstandiger Orthogonalraum zu a spannt mit a den ganzen H.R. auf. 
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zunichst eine Reihe von geometrischen Hilfssitzen tiber Rotationskérper zu- 
sammenstellen. 

1. Hitrssatz: Ist a; eine k-dimensionale Achse des Rotationskorpers ®, so ist 
auch jeder endlichdimensionale ebene Raum a, von einer Dimension | > k, welcher 
a, enthdlt, ebenfalls Achse von §. 

Dieser Satz folgt unmittelbar aus der Definition einer Rotationsachse. 

2. Hitrssatz: Die Rotationskérper bilden einer Mengenkérper. Brwets: Der 
Rotationskérper # habe die Achse a, der Rotationskérper © die Achse 6b. 
Man bilde den Verbindungsraum c von a und 5, d. h. den ebenen Raum kleinster 
Dimension, welcher sowohl a als 6 enthalt. Seine Dimension ist bekanntlich 
nicht grésser als die Summe der Dimensionen von a und b vermehrt um eins. 
Nach dem 1. Hilfssatz ist ¢c Achse sowohl von ®t als auch von ©. Hieraus folgt 
offenbar, dass auch ® + S und wenn KR < © ist, auch SG — M simtliche Rota- 
tionen um c gestattet. 

3. Hitrssatz: Sind a und 6 Achsen des Rotationskérpers R, so ist auch der 
Verbindungsraum von a und b Achse von R. 

Der Satz ist eine unmittelbare Folge des 1. Hilfssatzes. 

4. Hiurssatz: Sind a und b zwei Achsen des Rotationskérpers R und haben 
a und b einen nichtleeren Durchschnitt d, dann ist auch dieser Achse von §. 

Wir beweisen die etwas schdrfere Aussage: 

4’. Hitrssatz: Jede Rotation wm den nichtleeren Durchschnitt d zweier ebener 
Réaume endlicher Dimension a und 6, laésst sich aus endlich vielen Rotationen um 
a und b zusammensetzen. 

Wir schliessen durch vollstandige Induktion nach dem Wert von s = p + q 
worin p und qg die Dimensionen von a und 6 bedeuten. Da man annehmen 
kann, dass keiner der beiden Raume a und 6 in dem anderen ganz enthalten ist, 
ist der kleinstmégliche Wert von s gleich 2 und wird fiir p = gq = 1 geliefert. 
a und 6 sind dann zwei sich in einem Punkt D schneidende Geraden. Hier 
kann man folgendermassen schliessen. Durch eine Rotation B um 6 entstehe 
aus a die Gerade a* # a. Sie geht ebenfalls durch D hindurch. Jede Rotation 
A* um a* lasst sich vermége einer zugehérigen Rotation A um a in der Form 
A* = B ‘AB darstellen. Wenn also der zu beweisende Satz fiir das Achsenpaar 
a, a* richtig ist, so auch fiir das Paar a, 6. Sei nun 6 der zwischen 0 und 37 
gelegene Winkel zwischen a und 6. Der Winkel zwischen a und a* variiert mit 
a* zwischen 0 und 26. Ist 26 = 42, so kann man a* so wahlen, dass der Winkel 
zwischen a und a* gleich 37 ist. Ist aber 25 < 32, so wihle man a* so, dass der 
Winkel zwischen a und a* den maximalen Wert 25 annimmt und verfahre dann 
mit a und a* ebenso wie vorher mit a und 6. Nach endlich vielen Schritten 
dieser Art gelangt man schliesslich zu einem Achsenpaar a’, 6’ durch D, welche 
aufeinander senkrecht stehen und die wesentliche Eigenschaft haben, dass jede 
Rotation um a’ baw. 6’ sich aus endlich vielen Rotationen um a und 6 zusam- 
mensetzen lisst. Ebenso wie im dreidimensionalen euklidischen Raume be- 
weist man aber, dass jede Rotation um den Schnittpunkt zweier einander 
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senkrecht schneidender Geraden a’, 6’ aus héchstens drei Rotationen um a’ 
und 6’ zusammengesetzt werden kann. 

Sei nun p + g > 2. Ist die Dimension d des Durchschnitts ) von a und 6 
positiv, so ist jede Rotation um 6 (und um so mehr jede Rotation um a bzw. b) 
vollkommen bestimmt durch ihr Verhalten in einem vollstandigen Orthogonal- 
raum’ © zu d. Die Schnitte von a, b, ) mit O haben der Reihe nach die Di- 
mensionen p’ = p — d, q’ = q — d,d’ = 0. Die Summe s’ = p’ + q’ = 
p+q — 2d ist kleiner als s = p+ gq. Man kann also vollstiindige Induktion 
anwenden. 

Sei jetztd = 0,p > 0,¢g >0,p+q>2. Der Durchschnitt ist ein einzelner 
Punkt D. Ohne Beschrinkung der Allgemeinheit kann angenommen werden, 
dass p > 1 ist. Man verstehe unter a’ irgend einen durch D gehenden Un- 
terraum von a von der Dimension p — 1. Durch eine passende Rotation um 6 
kann a in eine Lage a* gebracht werden derart, dass a und a* genau a’ zum 
Durchschnitt haben. Die Dimensionen von a und a%* sind beide gleich p und 
die ihrer Schnitte mit einem vollstandigen Orthogonalraum zu a’ gleich 1. Da 
die Summe der letzteren gleich 2 ist, kann nach den friheren Uberlegungen 
behauptet werden, dass jede Rotation um a’ aus Rotationen um a und a* 
zusammengesetzt werden kann. Ebenso wie im Falle p = qg = 1 ist jede 
Rotation um a* durch Rotationen um a und 6 ausdriickbar. Im Ganzen kann 
man jede Rotation um a’ aus Rotationen um a und b zusammensetzen. Nun 
betrachte man das Achsenpaar a’, 6. Es hat eine um 1 kleinere Dimensions- 
summe als das Paar a, 6. Man kann also wieder vollstandige Induktion an- 
wenden. Da der Schnitt von a’ und 6 gleich dem Punkt D ist, ist der Satz 
somit vollstindig bewiesen. 

Es erweist sich notwendig, die bisher aufgestellten Satze noch ein wenig zu 
verschirfen. Der H. R. kann ebenso wie der endlich dimensionale euklidische 
Raum zu einem projektiven Raum erginzt werden, indem man jeder Richtung 
einen unendlich fernen Punkt der Geraden zuordnet, welche die gegebene Rich- 
tung besitzt. Man kann dann auch in der iiblichen Weise unendlich ferne 
ebene Raume einfiithren. Ferner kann man den Begriff der Orthogonalitat 
zweier Raume auf den Fall ausdehnen, dass einer von ihnen oder beide ins 
Unendliche fallen; denn schon bei eigentlichen Raumen ist ja die Aussage der 
Orthogonalitaét eine Aussage iiber ihre Schnitte mit dem Unendlichfernen. 

Unter einer Rotation um eine unendlich ferne endlichdimensionale Achse a 
verstehen wir jede isometrische Abbildung des H. R. in sich, welche jeden Punkt 
von a einzeln festhalt und jeden vollstdndigen Orthogonalraum zu a in sich 
tiberfiihrt. 

Zur formalen Abrundung fiihren noch wir den leeren Raum } = 0 ein, dem 
wir die Dimension — 1 zuschreiben. Unter einer Rotation um db = 0 kann jede 
isometrische Abbildung des H. R. in sich verstanden werden. Die einzigen Rota- 
tionskérper mit ) = 0 als Achse sind der ganze H. R. und die Nullmenge. 
Nach Einfiihrung dieser Begriffserweiterungen kénnen wir die Hilfssitze, die 
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" Ein vollsténdiger Orthogonalraum zu a spannt mit a den ganzen H.R. auf. 
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bisher aufgestellt worden sind, in dem Sinne verschdrfen, dass sie ihre Richtigkeit 
behalten auch in dem Falle, wo einer oder mehrere der darin auftretenden Raéume 
unendlich fern sind oder den leeren Raum darstellen. Die einfachen Beweise fiir 
diese Erweiterungen, die man iibrigens ein Falle } ¥ 0 als Grenzfiille der 
bisherigen Aussagen betrachten kann, kénnen dem Leser iiberlassen bleiben. 
Wenn die Hilfssdtze im Folgendem zttiert werden, sind ste stets in der verschirften 
Form gemeint. 

BEMERKUNG: Beim Beweise der Hilfssitze ist von der Unendlichdimen- 
sionalitét des H. R. kein Gebrauch gemacht worden. Die Hilfssitze 1, 2, 3 
und ihre Beweise gelten wortlich in endlichdimensionalen euklidischen Raum. 
Beim Beweise der Hilfssitze 4 und 4’ ist davon Gebrauch gemacht worden, 
dass der Verbindungsraum von a und 6 nicht der Vollraum ist. Unter dieser 
Voraussetzung gelten diese Satze also auch in euklidischen Raéumen endlicher 
Dimension. 

Wir sind jetzt im Stande, eine Ubersichtiiber die Gesamtheit aller Achsen eines 
Rotationskérpers R° zu gewinnen. Es sei a eine Achse von & kleinstméglicher 
Dimension. Jch behaupte, dass es nur eine Achse dieser Art gibt. Giibe es 
nimlich zwei verschiedene, a, 6 so miisste ihr Durchschnitt nach dem (in der 
verscharften Form angewandten) 4. Hilfssatz auch Achse sein und diese hitte 
eine kleinere Dimension als a und 6. Wenn im Folgenden von der Achse ay 
von & die Rede ist, so ist stets diese Achse kleinster Dimension gemeint. Nun 
sei a eine von ay verschiedene Achse. Da der Durchschnitt von a und ay 
keine kleinere Dimension haben darf als ay , muss a die Achse ag enthalten. 
Zusammenfassend kénnen wir also behaupten: 

5. Hitrssatz: Unter den Achsen eines Rotationskérpers gibt es genau eine, 
Ay , kleinster Dimension. Man bekommit alle iibrigen, indem man alle méglichen 
ebenen Réume endlicher Dimension durch ay legt. 

Von nun an sollen nur beschrénkte Rotationskérper  betrachtet werden. 
Die Achse ag eines solchen ist offenbar im Endlichen gelegen. 

Eine wesentliche Rolle spielt in spateren Uberlegungen eine Grésse, die jedem 
beschrinkten Rotationskérper § zugeordnet werden kann und die wir als 
Radius px von ® bezeichnen. Wir verstehen darunter die obere Grenze der 
Distanzen der Punkte von R von der Achse an. Wir beweisen jetzt eine Reihe 
von wichtigen Aussagen iiber py . 

6. Hitrssatz: Sei a irgend eine Achse des Rotationskérper R. Dann ist die 
obere Grenze der Distanzen der Punkte von R von a wieder gleich px . 

Der Beweis folgt fast unmittelbar aus der Definition von pg unter Zuhilfnahme 
des 5. Hilfssatzes. 

7. Hivrssatz: Fiir irgendwelche Rotationskérper Ri, Re, ---, Km gilt dre 
Formel 


(1’) PRi+R2+---+HRm = Max. (pg, , PR2» "°°" PRm)> 





8 ®t sei weder der Vollraum noch die Nullmenge. 
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Es geniigt, den Satz fiir m = 2 zu beweisen. Man verstehe unter » den 
Verbindungsraum von dy, und dy,. Er ist Achse sowohl von &; als auch von 
j,, also auch von %i + Rez. Die Gréssen py, , px, , Px,+9, Sind nach dem 6. 
Hilfssatz der Reihe nach gleich den oberen Grenzen der Distanzen der Punkte 
von 9:1, Re, Ri + Re von v. Hieraus ersieht man unmittelbar die Richtigkeit 
des Satzes. 

Im Hinblick auf unsere spitere Absicht, Rotationskérper zur Approximation 
allgemeiner Punktmengen zu beniitzen, kann die Frage aufgeworfen werden, 
ob der letzte Hilfssatz auf den Fall unendlich vieler Summanden ausgedehnt 
werden kann. Dies ist mit einer gewissen Einschrdnkung richtig. Es gilt 


nimlich der wichtige 
8. Hinrssatz: RR; Ri, Re, --- seien lauter Rotationskérper und 


(2) R= M+ Mt+--- 
Ferner set 
@) lim py, = 0. 
Dann ist 
(1) px = Max. (ox, , Px, -** ): 
Bewels: Da ®, in ® enthalten ist, hat man 
(4) Px, S Px (n = 1,2,---), 
also 
(5) oo = Max. (pg, , Px.» -*: ) S Px- 


Wir haben nachzuweisen, dass in (5) das Gleichheitszeichen giiltig ist. Wir 
betrachten zunachst den Spezialfall, dass ® eine abgeschlossene Kugel 8 ist. 
Wir schliessen indirekt. Sei oo < py. Auf Grund des 7. Hilfssatzes ist fir 


jedes S, = Ri + Ret --- +R, (nm = 1,2,---) 





(6) pe, = % (n = 1,2,---). 
Man wiahle den Index n = 7; so, dass 
(7) o1 = Max. (ppia,+11 P&a,+29 °°) <= 


ausfallt. Dies ist wegen lim,.«. py, = 0 mdglich. Aus der Definition von 
pe, und (6) ergibt sich offenbar, dass man innerhalb & eine abgeschlossene 
Kugel &, vom Radius p; wihlen kann, welche keinen Punkt von ©,, enthiilt. 
Die Ungleichung (7) lehrt nun, dass man nach Streichung von 91, Re, --- , Rn, 
diesen Prozess wiederholen kann. Man erhilt so eine Reihe von Indizes 


Ny <M Sees, 


* Ohne diese Zusatzforderung ist der Satz unrichtig. Man kann z. B. die Einheitskugel 
mit abzihlbar vielen Kugeln vom gleichen Radius p < 1 iiberdecken. 
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und eine Reihe von Kugeln 
KR > Ri > Re > --- 


derart, dass ®; keinen Punkt von ©,; (¢ = 1, 2, --- ) enthalt. Nun gibt es 
aber einen Punkt 7’, der in allen Kugeln §; enthalten ist. Dieser ist wegen 
der Abgeschlossenheit von & in & enthalten, aber in keinem ®, (n = 1, 2, ... ), 
Dies steht mit R = Ri + Re + --- im Widerspruch. 

Wir gehen nun zu einem allgemeinen Rotationskérper ®t tiber. Wir nehmen 
wieder an, dass a9 < py ist. Dann existiert ein Punkt 7 in ®, dessen Distanz 
von dy grésser als oy ist. Durch T lege man den vollstandigen Orthogonalraum 
bh’ zuadg. 5’ ist wieder ein H. R. und wir wollen jetzt allein in ihm operieren. 
Seine Schnitte mit 2; 91, Ite, --- sollen der Reihe nach mit MR’; Ri, Re, -- 
bezeichnet werden. Das sind lauter Rotationskérper von h’ und fir ihre 
Radien gelten die Ungleichungen: 


PR > 90, 
PR, Ss PRn Ss %- 


Nun andern wir unsere Punktmengen &’; 91, 92, --- noch ein wenig ab. 
§’ ist in h’ rotationssymmetrisch in Bezug auf den Schnittpunkt M von §’ und 
dy und enthalt den Punkt T mit einer Distanz MT > oo. 8’ sei die abgeschlos- 
sene Kugel von }’ mit dem Mittelpunkt M und dem Radius MT. Wir bilden 
zunichst die Durchschnitte 


R= KR, N= KR, Ri = KR, --- 


Offenbar ist, wieder in }’, 


lA 


py > oo und py.’ S px, S oo (n = 1,2,---) 


und ausserdem 
R= Ri + Re + --- 


Jetzt ersetze man 9” durch §’ und ebenso jedes #, durch den Kérper Si. 
welcher entsteht, indem man zu jedem Punkt von 9%, seine ganze Verbindungs- 
strecke mit M hinzufiigt. Wir haben dann 


R =S+G2+--. 
und offenbar, in 6’, fiir jedes n 
Pe, = PRn’ » 
also 
pe, < Radius von &’. 


Damit sind wir auf den Fall der Kugel zuriickgefiihrt worden. Der Satz ist 
somit vollstandig bewiesen. 

Mit Hilfe des eben bewiesenen Hilfssatzes sind wir im Stande, den Begriff 
des Radius auf allgemeinste beschrankte Punktmengen auszudehnen. Wit 
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vollsiehen die Verallgemeinerung in zwei Etappen. Zunichst gehen wir zu 
einer Erweiterung des Systems der Rotationskérper iiber, die spiater eine wich- 
tige Rolle spielen wird, den Rotativkérpern. Wir nennen die Punktmenge % 
einen Rotativkérper, wenn sie die Darstellung 


(8) P= Rt+tRt+-:--: 
zulisst, worin die R, alle Rotationskérper bedeuten, deren Radien der Bedingung 


(9) lim Px, = 0 


no 


yeniigen. Man kann tibrigens stets zu einer Darstellung mit paarweise punkt- 
frenden Summanden iibergehen. Zu diesem Zweck setze man 


RX = (Ri + Me + --- + Mn) — (Mi + Me +--+ + Raa). 


Dann ist * wieder ein Rotationskérper und 


(8*) P=R+R+---, 

und ausserdem 

(8**) RR: = 0 (i # k). 
Da ®% in R, enthalten ist, ist py* S py, , also auch 

(9*) lim px, = 9. 


Der 8. Hilfssatz legt es nahe, als Radius py eines Rotativkérpers 8 den Wert 


uw = Max. (px, » PR» ***) 


zu definieren. Allerdings muss bewiesen werden, dass eine andere erlaubte 
Darstellung von $ mit gleichen Eigenschaften 


P= GtGt-- 


zu dem gleichen Wert fiihrt. 
BeweEis: Man setze 


v = Max. (pe, , pe, -*° ). 
Wir fihren die Durchschnitte Tj = 2;-G, ein. Es bestehen die Gleichungen 


R = L Ta, SG = Le Te. 


Nach dem 8. Hilfssatz ist 


pe; = Max. (0s;,, Pte, °** )s 


Pe, = Max. (ps1. » PTok» **" ). 
Es ist also 


pe = v = Maximum aller pg,, . 
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Fiir den Bereich der Rotativkérper gelten folgende Satze: 

9’. Hitrssatz: Summe und Durchschnitt zweier Rotativkérper ist wieder ein 
Rotativkérper.” 

10’. Hitrssarz: Sind $1, P2, --- , Bm irgend welche Rotativkérper, so ist 


(10’) PBit+Bet-+-+Bm = Max. (pg, »PBe, °°" 1 PBm)- 
9. Hitrssatz: Sind $1, Be, --- trgend welche Rotativkérper und 


lim Ps, = 0 
dann ist auch 
P=R+tB+--- 


ein Rotativkérper. 
10. Hitrssatz: Unter den Voraussetzungen des 9. Hilfssatzes ist 


(10) Pz = Max. (pg, » PBo» °° * ). 


Die Beweise dieser vier Hilfssitze sind so einfach, dass sie dem Leser iiber- 
lassen werden kénnen. 

Nun gehen wir daran, den Begriff des Radius auf allgemeinste Mengen des H. R. 
auszudehnen. Sei J eine soleche Menge. Gibt es Rotativkérper, welche 
umschliessen, dann haben deren Radien eine untere Grenze pp. Wir nennen 
sie den Radius von IN. Wenn es keine umschliessenden Rotativkérper gibt, so 
schreiben wir It den Radius pn = © zu. 

Mit dieser Definition gelten ganz allgemein die folgenden beiden Hilfssitze: 

11’. Hitrssatz: Sind M1, Ms, --- , tm irgend welche Mengen, so ist 


(11’) PR, +2 +--+, = Max. (pm, » PM. ,°°* ) PU) 
11. Hitrssatz: Sind Dt, Me, --- irgend welche Mengen, welche der Be- 
dingung 
lim PN, = 0 
gentigen, so ist 
(11) PM;+Me+--- = Max. (px, » PM, °°° ). 


Die Beweise ergeben sich fast unmittelbar aus den entsprechenden fir Rota- 
tivk6rper. 

BEMERKUNG: Bei der Definition des Radius einer Menge 9 haben wir die 
Gesamtheit der 2 umschliessenden Rotativkérper herangezogen. Es ist jedoch 
zu bemerken, dass man sich bei der Bestimmung von ps» auf solche Rotativ- 
kérper beschrinken kann, welche Vereinigung einer Folge von Kugeln mit gegen 
Null konvergierenden Radien sind. In der Tat lasst sich offenbar ein be- 
schriinkter Rotationskérper 9 mit endlich vielen Kugeln iiberdecken, deren Radien 


10 Die Differenzbildung kann aus dem System der Rotativkérper hinausfihren. 
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alle kleiner als eine vorgeschriebene Zahl p’ > py sind. Hieraus folgt, dass man 
auch einen beliebigen Rotativkérper $ durch einen aus Kugeln ausammen- 
gesetzten umschliessen kann, dessen Radius beliebig wenig von py abweicht. 
Hieraus ergibt sich die Richtigkeit unserer Bemerkung. 


3. Das Inhaltsmass bei Rotations-und Rotativkiérpern 


Bei der Einfiihrung des Inhaltsmasses im H. R. lassen wir uns von dem 
Cavalierischen Prinzip als heuristischem Prinzip leiten. Wir betrachten zu- 
nichst nur Rotationskérper R, welche den beiden Forderungen geniigen: 

1. Forperuna: Wie bisher soll ® ein beschrankter Kérper sein. 

2. ForperuNG: Ein Meridianschnitt von 8, dessen Dimension um 1 grésser 
ist als die Dimension k der Achse ag, ergebe eine messbare”’ Punktmenge. 
Spiter wird eine Verschdrfung dieser Forderung eingefiihrt werden (siehe 8. 827). 

Wir betrachten jetzt Meridianschnitte von allen méglichen Dimensionen. Sie 
sind offenbar alle messbar. Sei a; irgend eine Achse, ihre Dimension sei 1 2 k. 
Schneidet man § mit einem zu a, parallelen Raum gleicher Dimension im 
Abstand r = 0,” so erhalt man eine Punktmenge, welche im Allgemeinen (d. h. 
abgesehen von einer Menge vor r-Werten vom Masse Null) messbar ist.'* Dies 
ist bekanntlich eine Folge der Messbarkeit eines Meridianschnittes von der 
Dimension 1 + 1 (1 = k). Der Inhalt des Schnittes heisse yy(r). 

Es ist 


(12’) uy(r) = 0 und identisch Null fiir r > py 
und 
(12”’) ug(r) integrabel fiir r = 0. 


Zwischen yy(r)-Funktionen, deren Indizes um eins differieren, besteht, wie eine 
einfache geometrische Uberlegung zeigt, die Rekursionsformel 


(13’) up(r) = 2 I f pe ((r? + 7°)*) dr (l>k+ 1). 


Fiihrt man die Integrationsvariable ¢ = (r° + 7’) statt 7 ein, so nimmt (13’) 
die Gestalt 


ce) 14 
(13”) wale) =2f uk) (pin sop Gek+t) 
an. Setzt man wy(x) = wy(a*), so wird (13) noch einfacher: 

yes dé 1 
(13) one) = | o'® <a 


_ 


" Messbarkeit von Punktmengen und Integrabilitét von Funktionen soll stets im 
Lebesgueschen Sinne gemeint sein. 

® Ist 1 = 0, so ist ein Punkt im Abstande r vom Rotationszentrum zu wahlen. 

’ Im Fall 1 = 0 ist als Mass eines Punktes der Wert 1 zu nehmen. 

‘Es ist zu beachten, dass als obere Grenze des Integrals (13”) der endliche Wert px, 
als obere Grenze des Integrals (13) px genommen werden kann. 
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Der Zusammenhang zwischen w(x) und wy (x) wird also durch eine Abelsche 
Integraloperation vermittelt. Denkt man sich wy(r) bekannt, so ist wy (2) 
Lésung einer Abelschen Integralgleichung. Nun ist mit wR(x) auch wy(zx) fiir 
jedes m = k integrabel. Auf der andern Seite ist bekannt, dass die Abelsche 
Integralgleichung (13) bei gegebener integrabler linker Seite, héchstens eine 
integrable Lésung haben kann.” Hieraus folgt: 

1. Sarz: Kennt man in der Reihe u(r), ug (r), --- auch nur ein einziges 
Glied, so kann man alle tibrigen berechnen. 

Wir versuchen jetzt eine naturgemasse Definition der Inhaltsgleichheit zweier 
Rotationskérper ® und © zu finden. Wir setzen voraus, dass beide den zwei 
aufgestellten Forderungen geniigen. Wir kénnen also die Funktionenfolge 


(14’) ug(r), ug (7), °° 
und eine entsprechende fiir © (die Dimension von ae sei 1) 
(14’’) uo(r), ue (r), hie 


bilden. Liasst man sich von dem Cavalierischen Prinzip als heuristischem 
Prinzip leiten, so kommt man dazu, ® und © sicherlich dann als inhaltsgleich 
zu betrachten, wenn fiir irgendein m 2 k, 1 


(15) ug(r) = wo(r) (r = 0) 


ist. Wir wissen, dass dann diese Gleichheit fiir jedes m = k,l besteht. Spitere 
Uberlegungen (vgl. 6. Satz, S. 830) werden zeigen, dass es zweckmissig ist, nur 
dann die K6rper als inhaltsgleich zu betrachten, wenn diese Gleichheit zutrifft. 
Wir definieren also: 

DEFINITION: St und S werden inhaltsgleich genannt, wenn fiir ein m 2 k, | 
die Gleichung (15) besteht. 

Nun zur Definition des Inhaltsmasses. Die Rotationskérper welche unseren 
beiden Forderungen geniigen, bilden einen Mengenkérper; denn sind ® und © 
zwei K6érper des Systems, so ist der Verbindungsraum » von ag und ae Achse 
von ® + © und falls R < S ist, von SG — KR. Ein Meridian durch » liefert 
also messbare Schnitte sowohl mit ® als auch mit S, also auch mit S + WR. 
Wenn aber iiberhaupt ein Meridian eines Rotationskérpers eine messbare 
Schnittmenge liefert, so auch, wie man sehr leicht einsieht, der kleinstméglicher 
Dimension. 

Sind ® und S punktfremd, so ergibt die Betrachtung eines gemeinsamen 
Meridians von der Dimension m 


were(r) = ue + wo(r). 


Es besteht also Additivitat. Man kénnte deshalb daran denken y5(r) fiir genii- 
gend hohes m als Inhaltsmass von § einzufiihren. Bei Zusammensetzungen 





© Wie tiblich, sind hier auch wie im Folgenden zwei Funktionen als identisch anzusehen, 
wenn sie sich nur in einer Menge von Argumentwerten vom Mass Null unterscheiden. 
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von Kérpern fiihrt aber die Notwendigkeit, den Index m zu variieren, zu 
Schwierigkeiten. Man kann sie jedoch in folgender Weise umgehen. Man 
betrachte die Gleichung (13’’) und setze darin fiir 1 den Wert k ein. Die linke 
Seite ist die durch den Rotationskérper bestimmte Funktion uy (r). Wir wollen 
jet die tiber die 2. Forderung fiir R auf 825 hinausgehende Annahme machen, 
dass es eine Funktion un ‘(r) gibt, welche in jedem Intervall (e€, ©) mit einer 
positiven unteren Grenze ¢ integrabel ist und die Abelsche Integralgleichung (13’’) 
lost. Es gibt dann, wie man der Theorie dieser Gleichung entnimmt, genau eine 
Lisung. Da uy(r) = 0 ist fiir r > py, so ist offenbar auch 


” (r) = O(r > p») 


Nun setze man in (13) 1 = k — 1. Die linke Seite ist jetzt wieder eine be- 
kannte, in jedem Intervall («, ©) integrable Funktion. Wir nehmen wieder 
an, dass eine in jedem Intervall (e, ©) integrable Lésung uy °(r) existiert. Mit 
dieser setze man den Prozess fort. Wir nehmen an, dass man so bis zu einer 
Funktion uy(r) = wy(r) gelangen kann. Alle Funktionen uy '(r), u(r), - 
uy(r) verschwinden fiir r > py. Rotationskérper, welche ausser der Forderung 1. 
auf 8. 825 diesen zu wy (r) fiihrenden Prozess gestatten—er stellt eine Verschdrfung 
der 2. Forderung auf S. 825 dar—wollen wir als reguldre Rotationskérper be- 
zeichnen. Wir definieren nun: Die einem reguldren Rotationskérper R zugeordnete 
Funktion uy(r) nennen wir Inhaltsmass oder kurz Inhalt von R. 

Die Zweckmassigkeit dieser Definition ergibt sich aus den im Folgenden 
bewiesenen Satzen. 

2. Sarz: Sind R und S zwei reguldre Rotationskérper mit leerem Durchschnitt, 
sist auch R + S regular” und es ist 


(16) ugre(") = uy(r) + we(r). 


Brwets: Sei a eine gemeinsame Achse von # und © von der Dimension m- 
Ein Meridian von ® + © durch a ist gleichzeitig Meridian von ®, © und 
R + S und liefert eine messbare Schnittmenge mit KR +S. Es ist offenbar 


ugre(r) = ug(r) + ner). 
Daraus folgt offensichtlich die Regularitiét von ® + ©S und die Gleichung 
ug+e(r) = ug(r) + ue(r). 


3. Satz: Sind R und S reguldre Rotationskérper, R in S enthalten, dann ist 
auch © — R regulér und 


(17) He-e(T) = ue(r) — uel). 


Der Beweis ist ebenso zu fiihren wie beim 2. Satze. 
Derinition: Unter einem reguldren Rotativkérper $8 verstehe man evnen 


‘6 Bei nicht leerem Durchschnitt braucht ® + © nicht reguiir zu sein. 
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Kérper, welcher sich als Summe 


(18) P= R+MWt+-::: 


von héchstens abzdhlbar vielen reguldéren Rotationskérpern darstelstellen list, 
welche den beiden Bedingungen 


(19) RR. = 0 (i ~ k); 
(20) lim Px, = 0 
gentigen. 
Wir bilden die Reihe 
(21) o(r) rue Me, (7) + Mg, (7) +.:-- 


Sie konvergiert fiir jeden positiven Wert vonr. In der Tat ist bei vorgegebenem 
Wert 7>o > O nur fiir endlich viele n der Radius py, > 70, nur endlich viele 
Reihenglieder in (21) sind also fiir r > 7» von Null verschieden. Es gilt nun der 
4. Sarz: Die Summe a(r) in (21) Gndert sich nicht, wenn man von der Dar- 
stellung (18) zu einer neuen von den gleichen Eigenschaften ribergeht. 
Sei etwa 


(18’) P=Git+ Geot+-::-. 

Die ©, seien alle regular, ferner 

(19’) SiG. = 0 (i £ k), 
(20’) lim pe, = 0. 


Man bilde die (21) entsprechende Reihe 
(21’) t(r) 3g He, (7) ¥ ue, (7) F-::. 


Wir fiihren die Bezeichnung T% = R;G, ein. Alle Durchschnitte T&T, sind 
wieder (nicht notwendig regulaire) Rotationskérper und es ist 


(22) PI ix s PR: » PI iz < PS, - 


Sei nun 7 ein irgendwie vorgegebener positiver Wert. Man bestimme zuge- 
hérige Indizes n; und nz so, dass fiir jedes 


(23) n>m der Radius px, < 7% 
und fiir jedes 

(23’) n> mn der Radius pe, < ro 
ausfallt. Wegen (22) ist auch 


(24) px;, < 70 fir «+ >m oder k> me. 
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Man setze ferner 


ng cy 

(25) Ui = R — DD Ta = pe Tix (¢ = 1,2, ---, m) 
k=1 k=ngtl 

und 

(25’) VY. = Sr — 2 Ta = a Tix (kK = 1,2, ---, mg). 


Die Ul; und &, sind Rotationskérper. Wendet man auf (25) und (25’) den 8. 
Hilfssatz des vorigen Paragraphen an, so erhalt man wegen (24) 


(26) Pu; < To (7 _ 1, 2, ee » m1), 
(26’) PRE < 7 (k = 1, 2, te Ne). 
Nun betrachten wir wieder die Reihen (21) und (21’), aber nur fiir r = 1% 


Aus (23) und (23’) folgt 


(27) o()'* > jete,° yrs > aay) b eR. 


Sei a eine gemeinsame Achse der Rotationskérper 
Tix (2 oi 1, 2, Bt »m > k = 1, 2, ital , M2), Ui (7 ae 1, 2, .* ,»™), 
Vi (k = 1, 2, --- , me). 


Sie habe die Dimension m. Die Gleichungen (25) und (25’) zeigen zusammen 
mit den Aussagen (26) und (26’), dass fiir r = 7 die Gleichungen bestehen 


n2 
(28) um,(r) = > ug,.(r) fir r& ro (¢ = 1,2, ---,m) 
1 
und 
ni 
(28') ue, (7) ca } uri, (7) fir r = 1 (k rad 1, 2, ee , a), 
t=1 
also 
ni n2 
> ug (r) = > ue, (r) (r = 1), 
also auch 
ni n2 
> ugar) = D we, (r) (r = 1). 
i=1 k=1 


Dies zeigt im Verein mit (27) die Identitét von o(r) und 7(r). 

Der bewiesene Satz gibt uns die Méglichkeit, den Jnhalt fir einen beliebigen 
reguléren Rotativkérper % zu definieren: Wir verstehen darunter die Rethensumme 
(21) fur eine Zerlegung von § in reguldre Rotationskérper Rn (n = 1, 2, --- ), 
welche die Eigenschaften (19) und (20) besitzen. 
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Im Bereich der regulairen Rotativkérper gilt der allgemeine Satz: 


5. Satz: Set $1, Bo, --- eine héchstens abzdhlbare Reihe von reguléren Rota- 
tivkérpern und es set 
(29) PP. = 0 (t ¥ k) 
und 
(30) lim Ps, = 0. 
Dann ist auch 

B= B+ Bt. 

ein reguldrer Rotativkérper und es ist 
(31) ug(r) = wg, (r) + ugi(r) + ---. 


Der Beweis ist auf Grund des Bisherigen so einfach, dass er dem Leser iiber- 
lassen bleiben kann. 

Wir beweisen nun einen Satz, der eine Rechtfertigung der von uns einge- 
fihrten Definition des Inhaltsmasses enthalt. 

6. Satz: Wenn zwei reguldre Rotativkérper und QD gleiches Inhaltsmass 
haben, dann. kann man zwei Rotativkérper J’ und Q’ finden derart, dass 


Pp’ < §, a <<); py_y = 0, Po-g' = 0 


ist und dass $3’ und Q' eine Zerlegung in paarweise Cavalierisch gleiche Rotations- 
kérper mit gegen Null honvergierenden Radien gestatten. 
BreweEts: Seien 


$ a8 }» Ri, OD aa 3 S; 
Zerlegungen von $ bzw. Q in regulire Rotativkérper 


RR. = 0, GG.=0 (#k); lim px, = lim pe, = 0. 


no 


Nach Voraussetzung ist 


(32) wai(r) + uge(r) +--+ = we,(r) + we,(r) +---. 
Nun wahle man irgendwie eine monoton abnehmende Nullfolge 


m>Te> +++3 lim r, = 0. 
Sei c; eine gemeinsame Achse aller 2%, , S, , deren Radien mindestens gleich r: 
sind. Ihre Dimension sei m,. Unter 9, verstehe man den Teil eines solchen 
: . e e / 
RM, , dessen Punkte von ¢; mindestens die Distanz 7; haben. Sonst sei R, = 0. 
Analog sei ©, zu erkliren. Aus (32) folgt 


(32') mai(r) + wga(r) + +++ = wet(r) + weir) + + 
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und hieraus offenbar fiir die endlich vielen von Null verschiedenen ®,, , S;, 


(3') woe; (r) + meg r) + -+- = weir) + wey) + ---, 
oder 
(33) Me +o;+--- = Mel+e:4.... 


Die Summen RF = >, R., St = DS! bestehen nur aus endlich vielen 


Gliedern, Sie stellen also Rotationskérper dar, welche nach (33) Cavalierisch 
gleich sind. Nun verfahre man mit $1 = $B — Ri, Qi = Q — Sf ebenso 
wie bis jetzt mit % und Q, indem man ausserdem 7; durch rz ersetzt. Man 
erhilt so zwei neue Cavalierischgleiche Rotationskérper R2 und G; u.s.w. Man 
zeigt nun leicht, dass 


P=RM+MWw+---, A=S+S4... 


Rotativkérper mit Zerlegungen derselben darstellen, wie sie unser Satz fordert. 

Zum Schluss dieser Paragraphen stellen wir die Frage, welche Funktionen 
u(r) als Inhalt von reguléren Rotativkérpern in Betracht kommen und inwiefern 
sie ein nichtarchimedisches Gréssensystem bilden. 

Drei notwendige Bedingungen fiir u(r) sind aus der Entstehung einer Inhalts- 
funktion sofort abzulesen: 

(a) u(r) verschwindet ausserhalb eines endlichen Intervalls (0, ro) (ro > 0) 

(b) In jedem Intervall (€, ro) mit positiver unteren Grenze « ist u(r) integrabel. 

(c) u(r) ldsst sich in der Form 


u(r) = > bn(r) 


schreiben, wo die un(r) die Eigenschaften haben: 

1) Die un(r) erfiillen die Bedingung (a) und (b) mit Werten ro , welche mit 
wachsendem n gegen Null konvergieren. 

2) Eine gentigend hohe von n abhdngige Iteration der Abelschen Operation ver- 
wandelt u»(r) in eine nichtnegative Funktion. 

Wir wollen zeigen, dass die drei Bedingungen auch hinreichend sind. Die von 
un() geforderten Eigenschaften zeigen, dass es Inhalt eines reguliren Rota- 
tionskérper ist, dessen Radius nicht grésser ist alsr¢ . Nun denke man sich eine 
Folge von parallelen Hyperebenen” 6; , b2, --- des H. R. im Abstandd = Max. 
(1, ++ ). Den zu u,(r) gehérigen Rotationskérper kann man vollstandig 
zwischen die n** und die (n + 1) Hyperebene unterbringen. In dieser Lage 
sind die Rotationskérper also alle punktfremd, ihre Vereinigung ergibt aber 
einen Rotativkérper mit dem Inhalt u(r). 

Die Funktionen, welche die ersten beiden Eigenschaften und von der dritten 
nur die Teileigenschaft 1 haben, bilden einen Funktionenmodul; denn die Prozesse 


"Unter Hyperbene ist ein vollstandiger Orthogonalraum zu einer Richtung zu 
verstehen. 
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der Addition und Subtraktion fiihren aus ihm nicht heraus. Die nicht identisch 
verschwindenden Funktionen, welche auch noch die vollstindige dritte Eigen- 
schaft haben, bilden ein Teilsystem des Moduls, das wohl noch die Addition, 
aber nicht die Subtraktion gestattet. Man kann sie als die positiven Gréssen 
des Moduls bezeichnen. Sind u'(r) und u(r) zwei Funktionen des Moduls 
und »’(r) — u'(r) in dem eben angefiihrten Sinne positiv, so wird man y'(r) kleiner 
als y*(r) bezeichnen. Diese Relation ist transitiv. Es kann aber nicht be- 
hauptet werden, dass je zwei Elemente des Moduls vergleichbar sind. Inner- 
halb des Systems der positiven Gréssen gilt offenbar das Archimedische Axiom 
nicht, sie bilden ein nichtarchimedisches Zahlsystem. 


4. Inhalte von Koérpern, welche sich durch regulaére Rotativkérper 
approximieren lassen 


Sei Mt eine Punktmenge des H. R. von folgenden Eigenschaften: Zu jedem 
positiven e gebe es einen Jt umschliessenden reguléren Rotativkérper derart, 
dass $ — I einen Radius pg_m < «hat. Wir zeigen, dass einer selchen Punkt- 
menge in naturgemidsser Weise ein Inhalt zugeschrieben werden kann. Sei 
nimlich Q ein zweiter zu demselben e gehériger regulérer Rotativkérper. Dann 
unterscheiden sich $ und Q von dem Durchschnitt BOQ um Punktmengen, 
welche beide in (8 — Mt) + (Q — M2) liegen. Aus pgm < € und po» < « 
folgt aber 


(34) PB-M+(O-mM) < 


Wegen der Regularitét von $ und OQ kannn man den Durchschnitt BQ als 
Summe von Rotationskérpern schreiben, deren Radien gegen Null konvergieren 
und deren Meridianschnitte messbar sind. Eine geniigend hohe Partialsumme 9 
erfiillt also die Bedingungen 


(35) R< BQ, ppo-w < «, 


(36) Die Meridianschnitte des Rotationskérpers R sind messbar. 
Aus (34) und (35) folgt, dass 


(37) Ppa <€ pom <e 


ist. Hieraus folgt leicht ug(r) = wa(r) fir r > € und geniigend hohes m und 
hieraus auch 


(38) ug(r) = wo (r) fiirr > « 


Wahlt man eine Nullfolge von e-Werten und zugehérigen , so konvergieren 
die ug(r) wie (38) zeigt, gegen eine wohlbestimmte Grenzfunktion um(r), die von 
den Willkirlichkeiten des Prozesses unabhingig ist. Wir nenen sie den Inhalt 
von IM und M selbst messbar. 

Mit dieser Definition lasst sich eine Inhaltslehre aufbauen, welche mit der 
im endlichdimensionalen euklidischen Raume viele Ziige gemein hat. Ein 
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fundamentaler Satz der letsteren ist allerdings nicht erfiillt: Die messbare 
Mengen bilden nicht einen o-K6rper; denn schon V ereinigung und Durchschnitt 
yon reguléren Rotationskérpern braucht nicht regular zu sein. 

In einer spateren Arbeit soll eine Erweiterung der hier gegebenen Inhalts- 
theorie vorgenommen werden, in welcher die messbaren Mengen einen Kérper 
bilden. Ausserdem soll dort eine Integrationstheorie entwickelt werden. 
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THE PLATEAU PROBLEM FOR NON-RELATIVE MINIMA 
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1. Introduction 


The Problem of Plateau is to find minimal surfaces bounded by a given 
Jordan curve T.’ Previous investigations’ have yielded the existence of mini- 
mal surfaces bounded by I which are either absolute or relative minima in 
area. The purpose of this investigation is to discuss minimal surfaces which 
are not relative minima, i.e., of the minimax type.” 

The present paper restricts itself to a special class of boundary curves I 
(described in §3), a class which includes curves having a continuously turning 
tangent line. Under this restriction, we shall prove the following two main 
results: 1) af T bounds two minimal surfaces which are proper relative minima, it 
bounds at least one minimal surface which is not a proper relative minimum (main 
theorem I, §4); 2) the Morse relations apply to the Plateau problem (main theorem 
II, §10). 

The method we shall use for obtaining minimal surfaces, following Douglas, 
Radé, Courant, is to consider them as extremals for the Dirichlet functional 


D{q] = ; / / (q.. + q;) du dv among all surfaces q(u, v) bounded by I’. To prove 


the first main theorem, let q’, q’’ be two minimal surfaces bounded by I which 
are proper relative minima. We shall join q’, q’’ by a connected set C,, of 
surfaces bounded by I on which the least upper bound of D[q] is the smallest 
possible. It is then to be expected that there exists on C a minimal surface q 
of the required type. This would be a consequence of the minimizing character 
of C, if D[q] were a continuous functional; but D[q] is merely lower semi-con- 
tinuous. The major part of this paper consists in overcoming this difficulty. 
The methods, especially theorems 4, 5, §§7, 8, developed to prove the first 





1 See Radé, ‘On the Problem of Plateau,’’ Ergebnisse der Math., vol. II, no. 2, 1933, 
for an excellent account of the Plateau problem and for further literature. 

* Besides the references in Radé, |.c., see Douglas, ‘“‘The Problem of Plateau,’’ Bull. 
Amer. Math. Soc., 1933, pp. 227-251, and ‘‘Minimal Surfaces of Higher Topological Struc- 
ture,’ Annals of Math., vol. 40, 1939, pp. 205-298; Courant, ‘‘Plateau’s Problem and 
Dirichlet’s Principle,’’ Annals of Math., vol. 38, 1937, pp. 679-724; Shiffman, ‘“The Problem 
of Plateau for Minimal Surfaces which are Relative Minima,’”’ Annals of Math., vol. 39, 
1938, pp. 309-315. 

**In the meantime, a paper by Morse and Tompkins on minimal surfaces of general 
critical type has appeared in the April 1939 issue of this journal. Summaries of their 
paper and of the present paper are contained in the March and April issues respectively 
of the Proc. Nat. Acad. Sc., 1939. 
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main theorem suffice to establish the Morse relations for the Plateau problem. 
These well-known relations between k-caps and connectivity numbers have been 
established by Morse in his abstract theory of the variational calculus.’ It 
remains to show that on each k-cap there is the desired minimal surface. 
Theorems 4, 5 serve to prove this. 

Sections 2, 3, 4 are of an introductory nature; the considerations in §$5, 6 
form the basis of our method; a fundamental deformation theorem is obtained 
in §7, and the variational condition in §8; finally, the proofs of the main theo- 
rems are completed in §§9, 10. 


2. The Relevant Spaces 


According to the Riemann-Weierstrass theorem,‘ a minimal surface q = q(u, v) 
(in vector notation) is characterized by the fact that isometric parameters u, v 
can be found, i.e., EH = G, F = 0 where E = qu, F = qua, G = q;, and that 
in terms of these parameters Quu + Qo = 0. The potential character of q(u, v) 
implies that the following function 


¢(w) = (qu — ig) = E-— G — UF 


is an analytic function of w = u + w, and the isometric character of u, v that 
¢(w) = 0. The procedure which we shall adopt for obtaining minimal surfaces 
is to consider them as the extremal surfaces for the Dirichlet functional* 


bel =3 [ [+ @)audo=} [fh + a dudv=} ff (at + hei) rare 


Let T be a given closed Jordan curve in space. We shall consider surfaces 
q = q(u, v) defined over the unit circle of the (u, v)-plane, which map the 
boundary of the unit circle monotonically on I’, and which are continuous and 
have piecewise continuous first derivatives. It will be convenient to use polar 
coérdinates 7, 6, and the surfaces will hereafter be expressed as q = q(r, 4), 
r $1. Since the Dirichlet functional is invariant under conformal mapping 
of the unit circle into itself, we may specify that three given points of the unit 
circle, 6; , 62, 63, be mapped by each surface q(r, @) into three given points A, 
B,C off. Surfaces q(r, @), mapping the boundary r = 1 monotonically ontoT, 
satisfying this three point condition, and having a finite Dirichlet integral, will 
be called admissible surfaces. 

It is convenient to classify the admissible surfaces into various spaces. The 
distance | q. — q2| between two surfaces qi(r, @), q2(r, 8) will be chosen as the 


* Morse, ‘‘Analysis in the Large,’’ notes of the Institute for Advanced Study, 1936-37, 
and ‘‘Functional Topology and Abstract Variational Theory,’ Annals of Math., vol. 38, 
1937, pp. 386-448. 

*Cf. Radé, lc. note 1, chap. 2. 

* Cf. note 2. 
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uniform distance, 


| qu a: qe | = = | qr, 0) Me | qe(r, 6)|. 


The space of all admissible surfaces with this metric will be denoted by &. 
The subspace of all potential surfaces in © will be designated by f. Finally, 
the space of all surfaces q of S, or of $, for which 


D{q] Ss N 


will be designated by Gy , or $y , respectively. 

It is clear that Sv C Sw if N < N’,and S = Dow Sy = limy.. Sy. Simi- 
larly for By. We shall now obtain some topological properties of the various 
spaces introduced. Whereas neither © nor Gy is compact, we have 

Lemma 1. The set By is compact and closed. 

Proor. Because of the three point condition and of the boundedness of the 
Dirichlet integral for all q in By , the boundary values of all the surfaces in By 
are equicontinuous.” From any sequence of surfaces in By, therefore, there is a 
subsequence qi, q2,--: whose boundary values converge uniformly. Since 
41, 42, --- are potential surfaces, they converge uniformly to a potential sur- 
face qg, and the derivatives of q:, q2,--- converge uniformly in any closed 
interior domain to the corresponding derivatives of q. This limit potential 
surface q has boundary values lying monotonically on I and satisfies the three 
point condition. Furthermore, in any circle of radius p < 1, 


D,<,{q) = lim D,<olqn] S lim Dlqn] S N; 

letting p> 1, D{[q] < N.’ Hence q belongs to $v and the lemma is proved. 

The connection between the spaces S and , Gy and Py , is given by 

Lemma 2. There is a deformation of S in itself which leaves each element of 
fixed and deforms S into f. This same deformation deforms the space Gn in 
itself into the space $n. Analytically expressed, there is a deformation f(q, t) 
defined and continuous for all q in S and allt in 0 S t S 1 such that 

1) f(q, t) ts in S, f(g, 0) = q, f(g, 1) = F where j is in $. 

2) f(g, t) = 4 fg ts in $. 

3) D[f(q, t)] is a monotonically decreasing function of t for fixed q;° in particular, 
D{q] = D{aq). 

Proor. Let §(r, 6) be the potential surface with the same boundary values 
as q(r, @). Define the surface f(q, ¢) for 0 < t S 1 by 


fq, =qg+t§-—qg) =G+(1 - dF 


6 Cf. Courant, l.c. note 2. 

’ This states the lower semi-continuity of D[q] in the class of potential surfaces. 

® Condition 3) is equivalent to the statement that f deforms Gy in itself into the 
space By . 
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where ¢ = a(r, 6) — G(r, @) and ¢ has the boundary values zero. By the mini- 
mizing character of potential surfaces with regard to the Dirichlet functional,’ 


Dif(a, t)] = Dig] + (1 — #)*D{x). 


Thus, D[f(q, #)] is a monotonically decreasing function of ¢ for fixed q, and 
f(q, t) satisfies properties 1), 2), 3) of the lemma. Furthermore, by a well- 
known theorem of potential theory, f(qn , tn) > f(q, t)” if qn > q, tn — t, so that 
{(q, #) is continuous. The lemma is proved. 

Thus, the spaces © and Gy may be replaced by the essentially equivalent 
but simpler spaces $ and By. 


3. The Admitted Boundary Curves I | 


In the course of the proofs in §§5, 6, certain restrictions will be made on the i 
boundary curves [ considered. Let q(@) be a proper representation of I’, where 
q(@) is a vector function and @ varies over the circumference of the unit circle. | 
We shall require the following two conditions to be satisfied: a 
1) q(@) is of bounded variation. | pea 
2) There is a 6 such that dq(@)-dq(¢) = 0 (a product of vectors!) for all 0, ¢ for wo 
which |@—@| <6." . 
In 2), the relation dq(@)-dq(@) = O shall be shorthand for the statement that 
there is an e such that 


[q(0 + A@) — g(9)]-[a@ + 46) — a()] 2 0 


for all positive (or all negative) A@, A¢@ for which | A@| < «, | Ad| < «. 

If the two conditions above hold for any one representation of I’, they hold 
for all representations of ©. For, any other representation )(@) of T is obtained 
from q(@) by performing a continuous monotonic transformation \(6) of the 
circumference of the unit circle into itself, »(@) = q(A(@)). Condition 1) is 
satisfied for 9(@) because of the monotonicity of \(@). Condition 2) is satisfied 
for )(@) for 6’ and e’ defined as follows: by virtue of the continuity of \(@) there 
isa 4’ such that | @ — @| < 6’ implies | \(6) — A(P) | < 4, and an é’ such that 






On ee oe 







































* Among all surfaces with given boundary values, the potential surface j has the smallest 
Dirichlet integral. If ¢ is any surface with boundary values 0 and with a finite Dirichlet 
integral, and if qe = § + ef then D{q.] = DG]. Since 


D{qe] = D[G] + 2 « DiG, 5] + @DI{z], 









the usual argument yields D{@, ¢] = 0, so that 
D{ge] = D[G] + &D{é). 






This applies to the case ¢ = q — 9; for then 
D{g] S ((D{q))* + (D{q))#)* s 4DIq). 


”—+ means convergence according to the metric of ©, i.e., uniform convergence. 
| @ — | means the length of the shorter arc joining @, ¢. 
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| A@| < e’ implies | \(@ + A4@) — A(@)| < «. Because of the monotonicity of 
(8), the quantities \(@ + A@) — r(), AW + Ad) — A) have the same sign if 
Ad, Ad have. Thus, for |@—¢| < 6’ and A@, A¢ both positive and < ¢’, we have 


[y(@ + 48) — y(4)]-[n@ + 4d) — 9@)] 
= [q(A( + A8)) — g(A(9))]-[4(A@ + A¢)) — q(A@))] = 0 


since q(@) satisfies 2). Hence 1), 2) hold for (8). 

Let q(@) be a given proper representation of I’ satisfying 1), 2). Let y(r, 6) 
be any surface in % with boundary values (6), so that »(@) = q(A(6)). Define 
the functional 7(y) as the smallest number 7 such that there are points 6, ¢ 
for which | 6 — @| = 7 and | A(@) — A@) | = 6. Then 9(@) satisfies properties 
1), 2) with 7(y) replacing 6. We prove now that the functional 7r(y) is lower 
semi-continuous. Let y,(r, #) — y(r, 8), so that A,(6) — (6) uniformly; let 
7(Yn), at least for a subsequence, converge to ¢t. Then, there are points 6, , ¢, 
for which | 6, — ¢n| = 7()n) and | An(@n) — An(n) | = 6. Choosing a subse- 
quence so that 0, — 0, d, — ¢, we get 


|@-—@|=# and |A(d) — A@)| = 4. 


Hence 7(y) S$ ¢. This proves that 7(y) S lim 7(y,), so that 7(y) is lower semi- 
continuous. 
Now, if 9 varies over the closed compact set By , 7(y) has a minimum which 
is attained for some surface q. Since 7(q) must be positive, this proves 
Lemma 3. There is a positive number ty such that any surface y(r, 0) in Bw 
has boundary values (0) satisfying: 


dy(6)-dy(¢) = 0 for all 0, ¢ for which |@ — | < mw. 


The meaning of conditions 1), 2) for the curve I is easily obtained. It is 
well-known that 1) asserts that T has a finite length; 2) states that the angle 
between the directed secant lines q(62) — q(@) and q(¢2) — q(¢:) is at most 
m/2 if | 0: — | < 6 and if 6 — 6, and ¢2 — ¢; are both positive and < «. 
Define the directions of T at the point @ as all the limits of the directed secant 
lines q(62) — q(@:) for 02 > 6, as 6. — 6, 6; > 6. Then 2) above implies that 
the angles formed by all the directions of T at the point @ and all the directions 
at ¢ are at most 7/2 if 6, ¢ are sufficiently close to each other, | @ — ¢| < 6. 
In particular, choosing @ = 4, all the directions of T at a point make angles 
at most 7/2 with each other. Conversely, if the directions of I at a single 
point make angles less than 7/2 with each other, and if this is true for every 
point of I’, then T has the property 2). 

Curves I which have a continuously turning tangent line have the properties 
1), 2). These curves are everywhere dense in the space of all Jordan curves. 

In what follows, we shall limit ourselves to boundary curves satisfying con- 
ditions 1), 2). 
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4. The First Main Theorem. The Maximum-minimum problem 


Our first major result is the following 

Main ToeorEM I. Let T be a Jordan curve satisfying properties 1), 2) of §3. 
If T bounds two minimal surfaces which are proper relative minima, it bounds at 
least one minimal surface which is not a proper relative minimum. 

The character of the minimum refers to the Dirichlet functional D(q) in the 
space ©. The surface q is said to be a proper relative minimum if D(y) > DJ{q) 
for every surface y in © different from q in a sufficiently small neighborhood of q. 

The proof of the theorem is based on a typical maximum-minimum problem. 
Let q’, q’’ be the two minimal surfaces which are proper relative minima. Let C 
be a closed connected set in S containing both q’ and q’’. Define d[C] to be 
the least upper bound of D{q] for all q in C; define d to be the greatest lower 
bound of d[C] for all such sets C: 

d{C] = Lu.b. D{q] 


inc 


d = g.Lb. d[C]. 
Cc 


The problem is to find a minimizing closed connected set Cn , i.e., one for which 
d{C,] = d, and then to establish on C,, the existence of the required minimal 
surface. 

THEOREM 1. There exists a minimizing closed connected set C» containing q’, 
q’ and contained completely in $. 

Proor. It is necessary to establish the existence of a set C for which d[C] 
is finite. This will be shown in the course of later work for the special class of 
boundary curves I considered here, and is contained in theorem 3, p. 825. 

By lemma 2, any closed connected set C containing q’, q’’ is deformed into 
such a set C all of whose surfaces belong to % and for which d[C] S d[C]. There- 
fore the lower bound of d[C] for all C’s in $ is also d. Let e+ 6, «2 
be a minimizing sequence of closed connected sets in $ containing q’, q’’, 
d(C") + d. Now for sufficiently large N, d[C"] < N for all n, and all the C” 
are contained in By. Construct the set C,, of all limit elements q of C “a 
of all q such that for some subsequence C”’, q"* — q where q”’ belongs to C”’. 
By virtue of the compactness and closedness of By , Cm is a closed connected 
set in $y containing q’,q’’. Because of the lower semi-continuity of the Dirichlet 
functional, 


D{q] < lim D[q™'] = lim d[C™] = d, 


so that d[C,] < -d. But d[C,] = d, and finally d[(C,,] = d. Theorem 1 is 
proved. 

Note that d is larger than both D[q’] and D[q’’]. For, Cn contains surfaces 
y' and y”’ arbitrarily close to q’ and q’’ respectively for which D[y’] > D{q’, 
D{y”] > D{q’’]. 

There remains for the proof of the first main theorem to establish the existence 
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of the required minimal surface on C,,. This would be an immediate conse- 
quence of the minimum character of Cn if D[q] were a continuous functional. 
But DJ[q] is only lower semi-continuous. The remainder of this paper is devoted 
to overcoming this difficulty. We shall prove (theorem 6, §7) that there is a 
minimal surface q on C,, for which D[q] = d. 


5. Expression for the Dirichlet Functional 


Let q(r, 0) be a potential surface defined over the unit circle with continuous 
boundary values q(@). Let a, , 6, be the Fourier coefficients of q(@). A simple 
calculation shows that the value of the Dirichlet integral of q(r, 6) over a circle 
of radius p, p < 1, is 


D,<,lq] ~~ 2 A n(a‘, + bn)p. 
This will be transformed into an expression containing the boundary values q(é) 
directly. In the course of the work, the assumptions stated in §3 will be made 
concerning q(@). These are finally enumerated at the end of this section in 
the statement of theorem 2. 
Assume that the boundary values q(@) are continuous and of bounded varia- 
tion. Then, by integration by parts, 


2r 2r 
a, = 2 [ q(@) cos n6dd = — if sin n6 dq(6), 
T 0 TN JO 


bn 


rp | q(@) sin n6d6 = _ [ cos né dq(6). 
wJo, mn Jo 


Substituting in the nee for D,<,[q], one obtains 


D,<,lq) = X Ti cos n(@ — ¢) dq(@) - da(¢) 
# ; If be cos n(0 — 6) [aa dq(¢) 


because of the uniform convergence of >~%_: p’"/n cos n(@ — ¢). This series 
is easily evaluated, 


i) 


2n 
p ‘ere 1 
p> - cos n(0 ¢) = 3 log | 2) ve 4p" ain (9 — $)’ 


by noting that it is the real part of oe z"/n, where z = pe *e-#) | Therefore 
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since i | dq(6) dq(@) = 0. Performing the limit p > 1 under the intégral sign, 


we get” 





1 
D{q] = i / i log Sin? 30 — g) 40 @9(@). 


To justify this passage to the limit, we shall make the additional assumption 
that there is a r such that dq(@) dq(¢) 2 0 for all 6, ¢ for which | 6 — ¢| < + 
(see §3). In this, and in what follows, the integration variables 6, ¢ are to 
vary along the circumference of the unit circle, and | @ — ¢| is to mean the 
shorter are between 8, ¢. 

a) Suppose that 





/ / log = 3 ‘a = a dq(@) dq(¢) 


is finite. We have 
1 


D,slal = ix( ff-e+ ff )s* (= ae outta 
2p 








|9—@|2r |@—@| <r 


In the first integral, the limit » — 1 can be performed under the sign of inte- 
gration. In the second integral, since (for p sufficiently close to 1) 


1 
) + sin? 40 — 4) 





< 106 Fin oes 





0 < log Baad 
(5 


and dq(6) dq(¢) = 0, the limit can also be performed under the sign of integra- 
tion.” Hence 


Dla] = lim D.splal = 7 | f toe 557% —z dato) ale). 


b) Suppose that ‘i is finite. Then 


J] 106 ssy%q—gy da dato) 


wS|9—-| <r 





“at sof J rs sf ap ) a 3(6 — 4) reaped 





st J 7 (5°) ae mao 





* The integral is to be understood as an improper Riemann-Stieltjes integral. 
* This is a well-known theorem of Lebesgue. 
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since the integrand is positive for |@ — ¢| <u < r. Because Dj{q] is finite, 
this last limit is finite. Letting u tend to zero, it follows that 


1 
/ / log sin? 3(6 — 4) dq(6) dq(¢) 


|9-—¢|<r 


is finite. Hence 





i | / log sin? i ig) dq(@) dq(¢) 


is finite, and case a) shows that it is equal to D[q]. 

This completes the proof of 

THEOREM 2. Let q(r, 0) be a potential surface defined over the unit circle with 
boundary values q(@). If 

1) q(@) is continuous and of bounded variation, 

2) there is a r such that dq(0) dq) = 0 for all 0, for which | 60 — $| < r, 
then 


1 ] 
Dial = = | [lot <r yq—z aul) ao. 


6. Lemmas on the Dirichlet Functional 


Let q(r, 6), y(r, 0) be two surfaces in $ with boundary values q(@), »(6) respec- 
tively. Since the boundary values both lie monotonically on I, 


y@) = q(@) where © = X(8) 


and A(@) is a monotonic function of 6. Because the boundary values q(6) or 
y(@) may be constant along some arcs, one must be careful concerning the 
definition of \(@). (@) is defined in the following manner. To a given point 
Q of T correspond two arcs, 0’ S$ @ < 6,0’ <0 < @”, which may reduce to 
points, consisting of all the values of 6,0 for which q(6) = Q, »@) = Q respec- 
tively. If these ares reduce to points [6’ = 6,0’ = 0”, set A(6’) = 0’; if 
0’ = 0”, set \(@) = O’ for all 6 in 6’ < 6 S 0”; if 6’ = 6”, (8) has a jump at 
tt / 

pe (0 — 0’) if 0 ¥ 0”, 
0’ #0”. The inverse 6 = un) to@ = (6) is defined in a similar way. 

We shall join q(r, @) and y(r, @) by a ‘linear’ family of surfaces all belonging to 
%. Define y.(r, 0) for ¢ in the interval 0 < ¢« < 1 as the potential surface with 
boundary values y.(@) given by“ ».(@) = q(@) for® = (1 — €)@ + e(8), or, 
equivalently, by 


6’ of amount 90” — 0’; and finally, set \(@) = 8’ + 


y.(@) = (6) for © = 6+ (1 — 6u(6). 


4 If \(@) has a jump at 6’ from @’ to @’’, then y.(@) is defined to be constant, = 4(9’), 
in the interval from (1 — €)@’ + «@’ to (1 — €)@’ + <0”. 


By 
€ 


nl — 








th 
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The boundary values of .(r, 9) lie monotonicallly on I’, the three point condition 
is satisfied, and Yo(r, 8) = q, w:(r, #2) = ». Now 





1 1 
Divd = x | f toe ra — yy ACO) ana), 


By first setting @ = (1 — €)@ + ed(0),® = (1 — €)d + eA), and then® = 
& + (1 — €)u(6), & = « + (1 — €)u@), we obtain 


1 
Diy.) = ran °F sin? $[(1 — 2) @ — @) +A@) — A@))I 





dq(6) dq(¢) 


and 


- 1 
Dil = a | | 8 ree ow FUT oGw Tae 1 





The first of these forms we shall use for the interval 0 S e« S 3, the second for 
}Se31. 

We shall first prove the following lemmas. 

Lemma 4. DJ[y,] is finite and is a continuous function of « in the closed interval 
0se81. 


Lemma 5. D[y.] SN +5 — Nag 8 sais" , where N is the larger of the two quanti- 


ties D[q] and D[y], tw ts the Bu 9 defined in lemma 3, and L is the length of the 
curve I’. 

Lemma 6. D[y.] has a continuous derivative with respect to ¢ in the open interval 
0<e<1,and 


= 3 d(@) — 8 — AW) — 9) 
i -E Sf tan $[(1 — 6)(@ — ¢) + €(A(6) — AP))] dq(@) dq(¢). 


Proors. We have, in the interval 0 < ¢ S }, 


oni= 2 J] 


\0—g)2r 


1 
+ / / log sin? 4[(1 — €)(0 — @) + €(A(6) — A(P))] dq(@) da(¢) 


|@—g| <7, 








where + = J, + Iz is so small that dq(@)dq(@) 2 Oif|@—¢|< 7. Now, for 
\9-$| <7 < 2/3 and0 S « S }, 


2|(1 — e-)(@ — ¢) + AO) — AW) |Z 4 — 2/6 -o|2zt|O-4I, 
and this same expression is S$ 3(| 0 —@| +4-2r) S$7—3|0—¢|. Therefore 





weap 3 c 
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sin? 3{(1 — e)(@ — ¢) + e(A(6) — A@))] = sin’ 3(6 — ¢), and the integrand in 
I. is 

1 
sin? 3(0 — ¢) 


+ log 4 cos’ 4(6 — ¢) S log 


< log = log 


1 
sin? 3(0 — ¢) 


1 
sin? 40 — 6) + log 4. 
Since D{q] is finite, this is integrable, and J, is finite and depends continuously 


one. Of course, J; is finite and depends continuously on «. Therefore D[y,] is 


finite and continuous for 0 S « S$ 3. Furthermore, 








2s - | i E 8 A =p + log | dq(@) dq(¢) 
|9@-—@| <7 
1 
$2 alls 6 TES H ana 
4 
a < zz 
tf € ite a rar", dq(@) dq(¢) +2 “ log 4 N ry “Io 08 = re 
a $|27 
also, r ' 
L 1 L 4 
ns : 6 sin? 37 s ig 8 cnt ty 


by an easy estimation. Hence, for0 S e S 3, 


Ry 4 
Reversing the réles of q(@) and y(6@) throughout the whole proof shows that 
D{y.] is finite and continuous for 4} S ¢ < 1, and is estimated by the same quan- 
tity above. Lemmas 4, 5 are piedad, 
To prove lemma 6, let ¢ lie in the interval a S$ e S$ 1 — a, wherea>0O. _Differ- 
entiate under the integral sign, 


mle -2( Jf + JI) 


|@—|2r = |0- | <r 








(6) — 8 — (Ag) — 4) 
dq(@) dq(¢) 
ian — )@— 4) + ea@) — xo ™ 
= I 3 co I 4. 
The differentiation is valid for the integral J;. For I, , set 


n= ff = 00-9) + = ro) 
de JJ tan 0 — 9@- 4) + Q® — r@))I 


dA) — 0— A) — 4) a 
T= )@— 4) + ea@ — ro WO 











® For, if en > ¢ then D[y.,] + D[y.] by the Lebesgue theorem. 


t 
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The first factor in the integrand is uniformly bounded, and so is the second 
factor. For, we have 


r@) —9- (Xd) —@) | 
(I — €)@ — 4) + €A@) — X@)) 


ie So 
tt ays. x = NG) 








so that 


1, @-e-Q@)-¢ 111 


= (1 — 2) — $) +e(A@) —AG)) ~e el—e 


This establishes the validity of the differentiation under the integral sign as 





: 
a 


well as the continuity of — ina Se«<1-— a.” Since a > 0 is arbitrary 


this proves lemma 6. 

Thus, q(r, 8) and y(r, 6) have been connected by a path contained entirely in 
$y for sufficiently large N. This important result was required in the proof of 
theorem 1, §4, and wiil be stated in the form of a theorem. 

THEOREM 3. Any two surfaces in $ can be connected by a continuous path of 
surfaces contained entirely in By for sufficiently large N. 

We shall now determine the appearance of the graph of D[y,] as a function of « 
for surfaces y sufficiently near q. This requires knowing where maxima and 
minima occur. 

Main Lemma 7. Let q(r, 0) be any surface in $ whose boundary values are not 
constant on any arc," and y‘”(r, 0) n = 1, 2, --- , a sequence of surfaces in B 
tending to q(r, 0). Let yS”(r, 6), 0 < € < 1, be the linear path joining q and y” 
as in lemmas 4, 5, 6. Then, 


dD{y:””] 
de 


—0 asn-o 
exe (n) 


implies that D[y{t)] — D[y]. 
Proor. Let q(@), y°”(0) be the boundary values of q(r, 4), y‘”(r, 6) respec- 
tively, so that 


y"(@) = q(0) for @ = A‘”(6). 


The condition y‘”(r, 6) — y(r, 6) implies that \‘”() converges uniformly to 6." 
In particular, the jumps of \‘”(@) tend to zero asn — ©. In all the following 
calculations, the superscript n is omitted. 





‘6 The integrand for (D[ve,se] — D[e])/Ae, a S ¢ S 1 — a, remains bounded as Ae — 0, 
so that the limit can be taken under the integral sign. 

‘7 The lemma also holds without this restriction, as follows by a slight modification of 
the proof. 

‘8 For, let \"(8) — (0), at least for a subsequence. From y”(@) — q(6) uniformly 
and y"(\"(@)) = q(6) we get q(A(@)) = q(@). Since q(@) is not constant on any arc, \(6) = @. 























846 MAX SHIFFMAN 

















We have 
| Diva - Dil= 3 ( ff + Sf) 
. [6-27 = |0- | <r 
ee ie ey sin’ $(6 — ¢) :) 
aan OB sin? HC — @ — 4) AG) — AG] OM = e+ Ie, and 
) hog sin? 4(6 — ¢) 
My I¢ re i dq(@) dq(¢) {lo og 6-6? 
i ” wanes 
wie | 1 
ee F°8 cost HU — J — 6) + <0@ — NO))] 
ba (@ - 9) \ = 
ee as + 18 tan? HU — )@— 6) Fea@ —rxe@pyy tet Ie 
| en The only singular integral is J. An estimate of Js is, because of log x < z, 
Hy kl ce s 
eerie 15 te ead. eH = 90 =o) FAW = KO WOME ~ K 
ipl Now, 
: dD[y.) _ 
ml -2([f + [f) 
|9@—¢|2r |9—¢| <7 


(6) — 6 — (AAP) — ¢) 
tan 3[(1 — €)(6 — ¢) + e€(A(@) — A))] 





dq(6) da(¢) = Is +14, and 











(1 — 6) — ¢) + (Al) — AY) 
I= —-— 
at I. tan al (1 pa €) (0 +. ¢) + €(A(8) we d(¢))] dq(6) dq(¢) 
A se 
7 Le tan 3[(1 — «)(6 — o) + €(A() — A))] dq(6) dq(¢) 
re 1 
Or, K = 2 a wi Say 


Reinsert the superscript n, and let n > ©, so that \°”(@) converges uniformly 
to 6. We have: 


I; 0, 


cae 0-6 
Js z df. tan} —@) dq(6) da(¢), 











and 


nd 


¢) 
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so that 


18@-o|< 


eS) 


further, 





i [0 sin’ we ¢) dq(6) da(¢), 


2 
|@—|<r — ¢) 


1 
hog | te > dale) aa), 


|@—¢| <r 





and 


—¢ 
lim J3 S lim K = 5 Edi - mz ae O) dq(@) dq(¢). 


Combining all these, we obtain 


sin” a5 ¢) 


fim (Dine) — Dio) sg ff tog FE — © aie) ance) 
|9@-—¢d|<r 
1 
‘<-- .* € cost Fo — @) 100) 2000) 
+2 : If ian 3G g) 19 dale). 


This inequality being true for all sufficiently small 7, let 7 0. Then 
lim (D[y{%&)] — D[q]) < 0. 

But 
lim (D[yit)] — D{q]) = 0 

because of the lower semi-continuity of the Dirichlet functional. Hence, 
lim (D[yit] — D[q]) = 0, 


and lemma 7 is proved. 

Lemma 7 will be stated in e, 6 form. 

Lemma 8. Let q(r, 0) be any surface in $3 whose boundary values are not constant 
on any arc.” For any n, there is an a and a 6 such that, if y(r, 0) is any surface 
in the 6-neighborhood of q, then 

| aD[n.] 


Diy. = Dla] + 1 implies |" 





= a. 


‘* The lemma also holds without this restriction. See note 17. 
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Proor. If this were not true for some n, there would exist a sequence y‘” in 


(n) 
Bq for which Dost] 2 Dla] + ybut 7H] 50, This contradicts 





lemma 5. 

Lemma 9. Let q(r, 6), 9, a, 6, y(r, 0) be asin lemma 8. There are two possibili- 
ties for the graph of D{y.] as a function of ein0 Se S1: 

1) If D[y] < D{q] + n, then D[y.] ts always less than D[q] + 7. 

2) If D[y] = D[q] + 1, there is an é such that D{y.] < Di[q] + nfor0 < « <é, 


Div.) = Dial + 1, and PM > afore se <1. 


Proor. By lemma 8, any maxima or minima in the graph of D[y.] must occur 
below D[q] + ». Hence 1) follows. Further, if Diy] = D[q] + 1, there is a 
first ¢ such that D[y.] < D[q] + n for 0 S « <é@, D[yz] = D[q] + n, and Diy] = 
D{q] + nforé < ¢« <1. By lemma 8, we must have oS 


ZainéSeSl. 


This completes the proof of lemma 9. 


7. The Fundamental Deformation Theorem for the Dirichlet Functional 


We shall now construct, on the basis of lemma 8, a deformation which is 
fundamental for our proof. It is contained in 

THEOREM 4. Let q(r, 0), n, a, 6 be as in lemma 8. There is a deformation 
S(v, t) of the space 3 in itself, defined and continuous for ally in $B and0 St S 1, 
which has the following properties: 

1) f(y, 0) = 9; denote f(y, 1) by §. 

2) f(a, t) =a; fo, =n ly—q| 26. 

3) Let|y — q| S 6. If Diy] S Dig] + 2, then Dif(v, t)] S Dla] + 1; of 
D{y] > D{q] + n, then D[f(y, t)] is a decreasing function of t for some interval 
0 st <f” and thereafter, i < t < 1, D[f(v, )] S Diq) + 2. 

4) Let|y»-—q| =6—068,0S0S1. Theneither D[§] S D[q] + nor Di] < 
D\y] — oa. 

Proor. Define f(y, t) = y(r, 0) if |y — q| = 6;if|y —q| = 6 — 06,08 
o <1, define f(y, t) = y:_.:(r, 6), where y.(r, 8) is the linear path joining q and 
(and e = 1 — ot). This certainly satisfies 1), 2). It satisfies 3) by virtue of 
lemma 9. For property 4), suppose that D[§] = D[y,.] > D{q] + 7. By 
lemma 9, dD[.]/de = a for 1 — o S € S 1, and we have 


Div) — Din = f é wiv) a. > a 


Thus, 4) is satisfied. Since f(y, ¢) is certainly continuous in y and ¢ (because 
y. is), the theorem is proved. 





207? depends on y. 











S 
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8. The Variational Condition 


Theorem 4 is the basic theorem which will eliminate the difficulty that the 
Dirichlet functional is merely lower semi-continuous and not continuous. In 
this section, we shall exploit the extremal character of minimal surfaces with 
regard to the Dirichlet functional. We shall obtain a deformation of the 
neighborhood of a surface in §, not a minimal surface, which diminishes the value 
of the Dirichlet integral. 

Let q(r, 8) be a surface in $ not a minimal surface. The analytic function 
ow) ofw=u+tw= re” defined by 


o(w) = (qu — iq)” = >” (« ~ i a) 


cannot vanish identically, for otherwise q(r, 6) would be a minimal surface. The 
expression for ¢(w) in polar coérdinates gives 


—e 
wo(w) =r («. oée a) = (r°q; — qs) — 2irarge , 
so that 
—2rqqe = S[w'¢o(w)]. 
Hence the potential function —2rq,qg cannot vanish identically.” Let (q, 7) 


be a point in polar coordinates where —2rq,q¢ ~ 0. Suppose, for example, that 
it is negative at this point, 


—2rq,qe = —8b <0 at (g,7). 


Now, if ) is a surface in § close to q, the derivatives of y at (g, y) are likewise 
close to the corresponding derivatives of q at (q, y). Choose 6 such that for any 
surface y in the 6-neighborhood of q, 


—2rv,ye S —4b at (q, v). 
Define the surface y,(r, 0) by 
y.(r,¢) = y(r, 0) for ¢ = 0+ aA(r, 9), 
where A(r, 8) is equal, in the annular ring (1 + g)/2 < r < 1 to the Poisson kernel 
1 fag 
2x r? — 2rq cos (6 — y) + ¢ 


with pole at (q, y), and in the circle r S (1 + q)/2 to some function which has 
bounded derivatives and attaches continuously to the values of the Poisson 
kernel on r = (1 + q)/2. Let M be the bounds of the derivatives, 





1 
|r, | < M,|%»| < M, and limit « to lel < oy: 





*' Otherwise w’¢(w) = constant = 0 by setting w = 0. 
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Map the unit circle conformally on itself so that the points 6, + eA(1, 6;:), 6 + 
eA(1, 62), 83 + ed(1, 63) of the boundary go into 6, , 62, 63; respectively, and let 
y. (r, 0) be the function resulting out of y.(r, 8). F inally, let 3.(r, 6) be the poten- 
tial surface with the same boundary values as y-(r, 0). The boundary values of 


3<(", 8) lie monotonically on I’, and the three point condition is satisfied. A 
whe calculation yields” 


Diul = Dvtl = Div = 3 [[ {() + 3 (2%) rarag 
7 sti \(» ee a we) “e E (1 san} (1 + ede) dr dp 
= Diy] + = sf i (ut - — 1M) — ary) raed 
+5 [J iQetan) ee 


= Dy] + sf d(—2ry-Y) do + eI, 
r=l 








where | I | < 2M*D[y]. The single integral is to mean 


lim A(—2rp, Yo) dé. 


pl Yr=p 


But, for p > (1 + q)/2, / A(—2ry,-9e) dd = (—2ry,Ye)r—g S —4b. The 
r=p 6=y 


coefficient of ¢ in the expression for D[3.] is therefore S —2b. 
Now let » belong to $y. Choose ¢ positive but < 7, where 7 is the smaller 
1 b 
h 
of the two quantities 2M’ 2MEN’ Then 


:f A(—2ry,ye) dd + 2 S —2b+b= —b. 
rel = =— 
Hence 


D{ij.] S Diy] — be for OS € Sr. 


We may define a deformation of $y in itself in the following manner. If 
iy — q| 2 4, set f(y, t) = y; if|y — q| = 6 — of, 0 S o S 1, set f(y, t) = 
gort(7, 8). We have f(y, 0) = jo = y, and D{f(y, t)] = Dilger] S D[y] — dort. 
Replacing br by 8 completes the proof of the following variational condition. 

TuEorEM 5. Let q(r, 0) be a surface in B not a minimal surface. For any 
N > D{q], there are two positive constants 6 and 8 and a deformation f(y, t) of the 





#2 See Courant, l.c., note 2. 
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By + space Bw in itself, defined and continuous for all y in By and 0 S t < 1, with the 
d let following properties: 

ten- 1) f(y, 0) = 9; denote f(y, 1) by §. 

es of 2) fo, t) =n fly—g| 26. 

A 3) If|y—q| = 6 — 06,0 So S11, then D[f(y, t)] < Dy] — Bot. In particu- 


lar, D{§] = Dly] — Be. 
9. Proof of the First Main Theorem 


We shall now tie-together the various threads of our argument. The supposi- 
tion is that T bounds two minimal surfaces q’, q’’ which are proper relative 
minima. Furthermore, I is restricted to those boundary curves described in 
§3. By theorem 3, it is possible to connect q’ and q”’ by a closed connected set C 
for which the upper bound d[C] of D{q] for all q on C is finite. By theorem 1, 
there exists a minimum closed connected set C,, in $ containing q’ and q”, i.e., 

"0 one for which d[C,,] = d, the smallest possible. This minimum value d is larger 

€XNg than both D[q’] and D[q’’]. We shall now prove, using theorem 4 when q(r, @) 
is a minimal surface and theorem 5 when q(r, @) is not, that on C,, there is a 
minimal surface 3 for which D[3] = d. To apply theorem 4, the following known 
lemma is required. 

LemMA 10. A minimal surface q(r, 0) in $ cannot be constant on any arc of the 
boundary. 

Proor. Suppose that q(r, 0) = constant = 0 on an arc of the unit circle. 
The minimal surface can then be.extended by reflection across this are by setting 

q(r, 8) = —q(1/r, 6) ifr > 1. The arc then lies in the interior of the domain of 
he q(r, 6), so that 








ler Ge = 5a (E = G) onthe are. 


Since qe = 0, it follows that q, = 0, and the analytic vector function 
F(w) =e” («. — i2 a) = 0 on the are. 


Hence F(w) = 0 and q(r, 8) = constant. This contradiction proves the lemma. 

THEOREM 6. On the minimum closed connected set Cm containing q’ and q’’, 
there exists a minimal surface 3 for which D[3] = d. 

Proor. Suppose that no such minimal surface exists on C,,. Every surface 
q on C,, would then be either a minimal surface for which D[q] < d, or not a 
minimal surface at all. Theorems 4, 5 will yield deformations which will dimin- 
ish the value of d[C,,] in the neighborhood of each surface. By an application 
7 of the Heine-Borel-Lebesgue theorem, we shall then obtain a deformation of 
wf Cn into C;, for which d[C,,] < d, contrary to the minimum character of Cm. 
To apply theorem 5, let N be any number > d. 

For any q on C, construct in Bw the open 6-neighborhood, N(q), of q where 6 
is defined as follows: if q is not a minimal surface, it is the 6 of theorem 5; if q 
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Map the unit circle conformally on itself so that the points 6 + «A(1, 6,), 6 + 
eA(1, 62), 83 + ed(1, 63) of the boundary go into 6, 62, 63 respectively, and let 
y. (r, 8) be the function resulting out of y.(r, @). Finally, let 3.(r, 9) be the poten- 
tial surface with the same boundary values as y.(r, 6). The boundary values of 


3-(r, 8) lie monotonically on I’, and the three point condition is satisfied. A 
simple calculation yields” 


Dil S$ Dly?] = Diy] = = z | / (2) + =()) dr dp 
ye: aL 1(» ae oa ) +) T 2G aay athe + oe) dr dp 
= Diy] + ; [| . (v — 4 vi) - ary) rd 
+i [Joules re 


= Di) +§ | d(—2r, ty) do + 21, 
r= 





where | J | < 2M*D[y]. The single integral is to mean 


lim A(—2ry, 9) dé. 


pl Yr=p 


But, for p > (1 + q)/2, / \(—2ry-He) dé = (—2r0-M)rmg =< —4b. The 
r=p =y 


coefficient of « in the expression for D[j.] is therefore < —2b. 
Now let » belong to $y. Choose « positive but < 7, where r is the smaller 
1 b 
of the two quantities —— 2M’ 2MEN’ Then 


4 A(—2ry,y9) dd + 2 S —2b+6 = —b. 
r=] = 
Hence 


Dij.] S Diy] — be for OS € Sr. 


We may define a deformation of By in itself in the following manner. If 
ly 6 q | = é, set f(y, t) -” y; if | y p= q | = 6 — a6, 0 a2<c3 1, set f(y, ¢) a 
jen(r, 8). We have f(y, 0) = jo = y, and Dff(y, t)] = Disord) S D[v] — bert. 
Replacing br by 8 completes the proof of the following variational condition. 

THEOREM 5. Let q(r, 0) be a surface in B not a minimal surface. For any 
N > D{q], there are two positive constants 6 and 8 and a deformation f(y, t) of the 





#2 See Courant, l.c., note 2. 
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following properties: 

2) fo, ) =n ly—q| 26. 

3) If|y—q| = 4 — 08,0 S o S11, then D[f(y, t)] < D[y] — Bot. In particu- 
lar, D{h] = Dly] — Bo. 


9. Proof of the First Main Theorem 


We shall now tie:together the various threads of our argument. The supposi- 
tion is that T bounds two minimal surfaces q’, q’’ which are proper relative 
minima. Furthermore, I is restricted to those boundary curves described in 
§3. By theorem 3, it is possible to connect q’ and q”’ by a closed connected set C 
for which the upper bound d[C] of D[q] for all q on C is finite. By theorem 1, 
there exists a minimum closed connected set C,, in containing q’ and q”’, i.e., 
one for which d[C,,] = d, the smallest possible. This minimum value d is larger 
than both D[q’] and D[q’’]. We shall now prove, using theorem 4 when q(r, @) 
is a minimal surface and theorem 5 when q(r, @) is not, that on C,, there is a oa 
minimal surface 3 for which D[;] = d. To apply theorem 4, the following known 1 ae 









space Bw in itself, defined and continuous for all y in Pv and 0 S t S 1, with the | y ; 
F ’ - 
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Proor. Suppose that q(r, #2) = constant = 0 on an arc of the unit circle. 
The minimal surface can then be extended by reflection across this are by setting 
q(r, 6) = —q(1/r, 0) ifr > 1. The arc then lies in the interior of the domain of + | 
q(r, 0), so that 






lemma is required. i 
Lemma 10. A minimal surface q(r, 6) in $ cannot be constant on any arc of the ! ie 
boundary. y i 
el} 
i 












q: = 5a (FE =G) onthe are. 






Since gg = 0, it follows that q, = 0, and the analytic vector function 





F(w) =e” (a. _ iz «) = 0 on the are. 





Hence F(w) = 0 and q(r, 6) = constant. This contradiction proves the lemma. 

THEOREM 6. On the minimum closed connected set Cm containing q' and q”’, 
there exists a minimal surface 3 for which D[3] = d. 

Proor. Suppose that no such minimal surface exists on C,. Every surface 
gon C, would then be either a minimal surface for which D{q] < d, or not a 
minimal surface at all. Theorems 4, 5 will yield deformations which will dimin- 
ish the value of d[C'»] in the neighborhood of each surface. By an application 
of the Heine-Borel-Lebesgue theorem, we shall then obtain a deformation of 
Cn into C;, for which d[C,] < d, contrary to the minimum character of Cn . 
To apply theorem 5, let N be any number > d. 

For any q on C, construct in Bw the open 6-neighborhood, N(q), of q where 6 ' 
is defined as follows: if q is not a minimal surface, it is the 6 of theorem 5; if q 
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is a minimal surface for which D[{q] < d, it is the 6 of theorem 4 (or lemma 8) for 
n = 3(d — D{q]). The property 4) of theorem 4 then reads 


D{q] + d 


Dil 25 


or D{5] S Dly] — ca. 

These neighborhoods cover the closed compact set C,, , so that a finite number of 
them, N;,(q,), v = 1, 2, --- , n, suffice to cover C,, completely. Let 7, be the 
deformation of Bw given by theorem 4 or theorem 5 corresponding to q,. Let c 
be the maximum of }(D[{q,] + d) for ali those surfaces q, of q1, qe, --- , qn 
which are minimal surfaces, c < d. (If there are no minimal surfaces among 
41, +: , Qn, then in the following argument either eliminate c or take c = 0.) 
Then each deformation 7, either does not increase the value of the Dirichlet 
functional, or does not increase it above c. By the deformation 7,7, ... 
T,-1T'n, therefore, Cm is mapped into a closed connected set Cm+ for which 
AUC m+] S d. 

We assert that d[C,] < d. If not, there would be a sequence y;, Ye, --- 
belonging to C,,+ for which Diy; ]—-d. These surfaces originate from a sequence 
Yi, Ye, --- belonging to C,, which converge (by choosing a subsequence) to a 
surface y of Cn. Let Ns,(q.) be the first neighborhood which contains y, and let 
ly — qe | = 6, — 206, where 0 < 20 $1. Let B) Beg be the images of 
1, Ye, --- respectively under the deformation 772 --- T;,-.. Since y remains 
fixed under the deformation, we have 9; — 9; hence, for all sufficiently large 
i,|9: — ae| < 6 — od,. Apply the deformation 7; and let each such y; be 


transformed into y: . If q, is a minimal surface, either D{y;] < sg: lee Zc 


) 


or D[y;] < D[yi] — axe < d — ayo by theorem 4; if q is not a minimal surface, 
D{aq:] < D[vi] — Bio S d — Bo. In any case, letting c’ be the larger of the 
quantities c,d — a0 or c,d — Byo, we have 


D{y;] < ce’ < d. 


Finally, the deformation T44:7%42 --- 7, maps 9; into y; , and D{y;] < c’. 
But this contradicts the choice of y; , for which D[y;] > d. Hence we must have 
A[Cme] < d. 

Under the deformation 7,7, --- 7, the minimal surfaces q’, q’’ describe paths 
on which D[q] S c’’ < d, where c” is the largest of the three quantities D(q’), 
D{q’’),c. Adding these paths to Cm+ yields a closed connected set C,, containing 
q’, q’’ for which d[C,’] < d. But we must have d[Cn’] = d. Thisis a contradic- 
tion, and our original supposition is false. Theorem 6 is proved. 

The minimal surface 3 on C, for which D[;] = d cannot be a proper relative 
minimum, since there are surfaces of C, in any arbitrary neighborhood of 3, 
and for each surface y of Cn D[y] S d = Dj]. The proof of main theorem 1, 
stated in §4, is complete. 





*3 Applied in the order from left to right. 
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PLATEAU PROBLEM FOR NON-RELATIVE MINIMA 


10. The Morse Relations 


The methods, especially theorems 4, 5, which have been developed here to 
prove the first main theorem also serve to establish the Morse relations.” In 
the terminology of Morse, a k-cap in the space $ with cap limit a is a k- 
dimensional Vietoris chain” on 3, which is a cycle d-mod §, (this is read defi- 
nitely mod 3, and means mod $$, for some a’ < a), and which is not homologous to 
zero on $, d-mod ¥,. In particular, the k-cap can not be deformed on §&, to a 
point set d-on ,. It follows from theorems 4, 5,”° exactly as in the proof of 
theorem 6, that on each k-cap with cap limit a there is a minimal surface q for 
which D[q] = a. 

Let M; be a maximal group of k-caps with rank.” If each M,,k =0,1,---, 
has a finite dimension yu; , then the Morse relations state 


uo = Ro 


Mi — wo = Rh, — Ry 
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Hn — Mn-1 + --- + (—1)"uo = Ra — Raa + Raa — «+» + (—1)"Ro 












where Ry, Ri, ---,Rna, +--+ are the connectivity numbers of $. These con- 
nectivity numbers are understood in the sense that only chains which lie on By 
for some sufficiently large N (depending on the chain) are admitted. 

Decompose the set of all minimal surfaces q in f for which D[q] = constant = 
a into its maximal connected components (as subsets of $), and call each such 
component a bloc of minimal surfaces. Associated to any bloc o of minimal 
surfaces are k-caps with cap limit a which lie in an arbitrary neighborhood of o.” 
The dimension of a maximal group of k-caps with rank associated with o is 
called by Morse the k** type number of ¢. Because each k-cap with cap limit a 
contains a minimal surface q for which D[q] = a, it follows that 4, is the sum of 
the k*" type numbers of all blocs of minimal surfaces. 

It remains to compute the connectivity numbers of §. 

ToeoreM 7. Let Ry, Ri,---,Rn,--- be the connectivity numbers of $. 
Then Ro = 1,R, = Re = --- =~ R, =--- = 0. 





* References to the extensive literature on the Morse relations will be found in Morse; 
“The Calculus of Variations in the Large,’’ Amer. Math. Soc. Coll. Publ., vol. 18 and 
“Functional Topology etc.,”’ l.c., note 3. The procedure and terminology which we shall 
follow are contained in the references cited in note 3. 

* Vietoris, ‘“‘Uber den héheren Zusammenhang kompakten Raume und eine Klasse von 
zusammenhangenstreuen Abbildungen,’’ Math. Ann. vol. 97, 1927, pp. 454-472. See 
Morse, ‘‘Functional Topology etc.,’’ l.c., note 3. 

** Theorem 4 isa more general property than the concept of “reducibility’’ as introduced 
by Morse. 

*7 Cf. Morse, l.c., note 3. 
*® Cf. Morse, l.c., notes 3 and 24. 
















































H ee, Done 








854 MAX SHIFFMAN 


Proor. The first relation Ro = 1 is the content of theorem 3. Let z",n > 1, 
be any n-dimensional singular cycle in $y. Choose any surface q in By , and 
connect each surface y in 2” to q by the linear path y, of lemmas 4, 5. This 
defines a singular (n + 1)-chain whose boundary is z” and which lies on By. for 


. L’ 4 
Meinl fea Qn log sin? $ry’ 
by lemma 5. Hence, z” is homologous to zero, and R, = 0. It is clear what 
modifications to make in this proof if Vietoris chains are used in place of singular 
chains. 

We have therefore established, on the basis of the Morse theory, the 

Main TueoreM II. Let uz be the sum of the k type numbers of all blocs of 
minimal surfaces bounded by I (with the restrictions on T' stated in §3). Under the 


assumption that each u, ,k = 0,1, --- , is finite, the following Morse relations hold: 
mw 21 


M1 — wo 2 —1 


IV 


bin = baa + ++ + (=1)"uo = (=1)" 


Several problems remain in connection with this theorem. First, to show 
that a bloc of minimal surfaces bounded by I consists of one minimal surface. 
Second, to show that the k’th type number of a minimal surface bounded by [ 
is either zero for all k, or zero for all k ¥ j and 1 for k = j. Third, to char- 
acterize the Morse type of a minimal surface by properties similar to the number 
of conjugate points (or the number of negative characteristic roots in the 
associated problem) in the case of single integral problems in the calculus of 
variations.” 


New York UNIVERSITY. 





*9 Cf. Morse, l.c., notes 3 and 24, where similar questions are considered for single in- 
tegral problems. Also, with reference to minimal surfaces, cf. the classical work of 
Schwarz, collected works, vol. I, pp. 151-167, 223-269. 
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A THEOREM CONCERNING ANALYTIC CONTINUATION FOR 
FUNCTIONS OF SEVERAL COMPLEX VARIABLES 


By A. E. Tartor! 
(Received March 6, 1939) 
1. The Main Theorem 


The purpose of this note is to prove the following remarkable theorem per- 
taining to analytic functions of several complex variables. : 


EIEIO RT IS eye ae - = 
























‘ TozorEM. Let 7' be a bounded open set with connected boundary C, in the ii 
id: n-dimensional (real) Euclidean space of the n complex variables 2, --- , Xn 1 
(n>1). Let f(ai,-+-,%n) = f(x) be a single-valued function which is analytic Bt 
{meromorphic] in some region containing C. Then it is possible to extend f, by | 
analytic continuation, to a function which is single-valued and analytic |mero- | | 
morphic] throughout T + C (that is, in some open set containing T + C). Moe. 
The bracketed words afford an alternative reading of the theorem as a state- i 4 
ment about meromorphic functions.” f 
The theorem, when f is analytic, is due to Hartogs; for meromorphic func- {i | 4 
Ww tions it was enunciated by E. E. Levi.’ The following proof grew out of a study eis 
7 of the demonstrations of the theorem given by Osgood and by A. B. Brown.‘ | i 
r The method is materially different from that of either of these men, however. i bi 
= It is the belief of the author that the proof to be given, rather long though it is ui is 
r in detail, is fundamentally extremely simple and easy to grasp intuitively. 1 
e The detail seems necessary to avoid unjustifiably hasty conclusions. 






2. The First Fundamental Lemma 


The theorem to be proved stems from the following lemma. 

Lemma 1. Let P be a boundary point of a sphere K in the 2n-dimensional 
space under consideration (n > 1), and let f be a function which is analytic |mero- 
morphic] in that portion of a neighborhood of P which lies outside K. Then it is 
possible to continue f across the boundary of K in the neighborhood of P so that 
the extended function is analytic [meromorphic] at P. (Here again the bracketed 
words afford an alternative reading). 



















‘Part of the work on this paper was done while the author was a National Research 
Fellow at Princeton University. 

? For the definition of a meromorphic function of several variables see Osgood, Lehrbuch 
der Funktionen theorie, II, 1 (1929), pp. 180-183. 

‘The theorems, with references to the literature, are to be found in Osgood, loc. cit. 
Pp. 206 and p, 220. 

‘The paper by A. B. Brown is in the Duke Mathematica! journal, vol. 2 (1936) p. 20 ff. 

‘For the proof of this lemma, with the two readings, see Osgood, loc. cit. p. 204 and 
p. 215. 
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Lemma 1 and the main theorem are false when n = 1, so that they are dis- 
tinctly propositions about functions of several variables. 

The only uses which we shall make of the theory of analytic functions in the 
proof will be the appeal to Lemma 1 and to the fact that an analytic [mero- 
morphic] function is uniquely determined by its values in an arbitrary open set 
within its domain of definition. The rest of the argument is concerned with the 
nature of certain point sets. We shall first outline the general plan of the 
proof, and then proceed with the details. We shall deal throughout with the 
case that f is analytic. No modification of the proof is needed for the alter- 
native theorem, when f is meromorphic. 

Under the hypothesis f is analytic in a spherical neighborhood about each 
point of C, and we can select a finite number of these open spheres such that 
their point set sum S contains C. Clearly S is a bounded, connected, open 
set, and sois 7+ S. We introduced the notations 7) = 7+C,E=T+5S. 
We shall prove that f may be extended analytically to all of EZ. 

Choose a point Py not in the closure of # and consider spheres about this 
point as center. There will obviously exist such a sphere (call its bounding 
surface 2) with points of E outside 2», points of C on 2», and no point of T 
outside 2». For as the radius 79 of 2» we can take the maximum distance 
between Po and a variable point of C. The function f is the analytic in the 
portion of E outside 2». Now it is evident that the sphere 2» could be made 
a little smaller and f would continue to be defined and analytic in the part of E 
outside the sphere. If this shrinking process were continued, however, we 
should ultimately have points of E on the sphere at which f would not be defined. 
For any individual obstruction of this kind Lemma 1 provides us with a remedy: 
we can extend f across the sphere at this point provided that f is defined at all 
the nearby points outside. 

Two questions now arise: (1) at any stage in the process of shrinking the 
sphere will it always be possible to shrink it more, and still have an analytic 
continuation of f defined at all the points of F outside the new sphere? (2) May 
there not be some sphere to which we can approach arbitrarily near with our 
slightly larger spheres, but which itself can never be attained in the shrinking 
process? 

By setting forth a definite procedure for passing from one sphere to a smaller 
one, all the while maintaining an analytic continuation of f into an open set 
containing all the points of E outside the sphere, we shall deal with the diffi- 
culties suggested by the above questions, and show that they may be overcome. 


3. The Second Fundamental Lemma 


First we shall define the concept of an attainable sphere. 

Derinition: A sphere with surface = and center Po will be called attainable 
under the following circumstances. There exists a nonempty open set G outside =, 
together with a function ¢ single-valued and analytic in G. G and ¢ shall have 
the further properties: (1) G contains all points of E outside 2; (2) a boundary 
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point of G outside 2 shall belong to the set B of boundary points of S not in E: 
(3) The pownts of the boundary of G on & fall into several classes, as described 
below. We distinguish two of these classes by the letters D, R, and a part of D by F. 

(a) D includes =- To; 

(b) R is composed of all points of 2-(E — To) not in D; 

(c) D is closed, D + R is open relative’ to =; 

(d) the set F of points of D which are limit points of R is a subset of C; 

(e) All the points near a point of D + R and outside = are in G; 

(f) the boundary points of G on = but not in D + R are in B. 

(4) ¢ = f at all the points near a point of R and outside &. 

It may be easily verified that the sphere 2» defined in §2 is attainable if we 
take Gp to be the part of EF outside 2», g = fin G, Do = Yo-To, and Ry = 
(EZ — To). 

Lemma 2. If > is attainable and D is non-empty there exists a smaller attain- 
able sphere X’, the corresponding set G’ and function ¢’ standing in the relations 
G’ DG, ¢’ = ginG. 

Proor: By Lemma 1 and (3e) there exists, corresponding to each point P 
of D, a small sphere with center at P such that ¢ admits an analytic continua- 
tion into the interior of the sphere. Since D is closed and bounded we may 
take a uniform radius 7 > 0 for all such spheres. If r is the radius of = and 6 
the minimum distance between C and the boundary of S we may choose e 
positive, less than »/2, 6 and r, and define 2’ as the sphere with center at P» 
and radius r’ = r — e. Then 

I. All points of E between 2, 2’ and a distance = « from D are in S. For 
the radius from Py through such a point P could not meet = in a point of D 
(since r — r’ = e). If the point were in EF — S (and so in 7) the radius would 
either have to meet = in a point of 7) (which is impossible, by (3a)), or cross C 
between P and >. In the latter case P would be a distance < « < 6 from C, 
and hence in S. 

We now define G’. It shall consist of all points of G, D, R, together with all 
points between = and >’ which are either (a) a distance < ¢ from D, or (8) 
points of E a distance = ¢ from D (such points are in S, by J). We leave it to 
the reader to verify that G’ is open, non-empty, and that it contains all points 
of E outside >’. 

II. If P is a point on or inside = then either (i) the minimum distance from P 
to points of D is attained at a point of F, or (ii) P is inside 2 and the radius from 
P; through P meets = in a point of D such that all the nearby points on = are also 
in D. This is a quite general proposition to which we shall frequently refer. 
It depends rather obviously on the geometry of the sphere, together with 
property (3c) and the definition of F. We leave detailed verification to the 
reader. 


‘ That is, D + R is such that if P is in it all the points on = and sufficiently near to P 
are also in D + R. 
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III. A boundary point of G’ outside 2’ and on or inside = is a distance > 5 
from D, and is therefore in B. For if P is such a point its minimum distance 
from D is attained at a point of F (otherwise it would be in G’, by II and the 
definition of G’). Since F is a subset of C, by (3d), P would be in S if its dis- 
tance from F were less than 6. This is impossible. Now if P were inside > 
it would be the limit of points of G’ a distance > ¢ from D (since « < 4) and 
hence in S, by I. Thus P would be in B. If P were on = it would be the 
limit of points of G’ outside, on, or inside 2. In the latter two cases it would 
be in B by an argument similar to the one just given. In the first case it would 
be in B by (8f). 

Next we define yg’. For points of G we define y’ = g. For points of R and 
points of S between 2, 2’ and a distance 2 ¢ from D we put yg’ = f. Fora 
point of D or a point between 2, 2’, a distance < ¢ from D we define y’ by the 
analytic continuation of ¢ into the neighborhood of points of D, as already 
mentioned. Since e < 7/2 the definition of gy’ at the points of this last class is 
unique, by the uniqueness of analytic continuation. 

In verifying that ¢’ is analytic in G’ the only points at which special attention 
is required are (i) points of R a distance < ¢ from D, and (ii) points of R a dis- 
tance « from D, and points between 2, =’, a distance «from D. For points P 
of class (i) we observe that R is open relative to = and that y’ = f at all points 
near P on or outside 2 (by (4)), while ¢g’ is an analytic continuation of ¢ at the 
points near P inside 2. Hence gy’ = f in the full neighborhood of P. Fora 
point P of class (ii) the nearest point Q of D is in F (and so in C) by II, (3d). 
Since PQ = « < 4 it follows that the entire neighborhood of P is in S, as is 
also the interior of the sphere of radius e about Q. Then gy’ = f in this sphere, 
since there are points of R near Q (by definition of F). Thus gy’ = f in the 
neighborhood of P. This completes the proof that ¢’ is analytic in G’. 

To complete the proof of the lemma we must define D’, R’, and show that >’ 
has the properties (3), (4) of an attainable sphere. 

DerFiniTIon: R’ is the set of points of 2'.(E — To) for which y’ = f at the 
nearby points outside 2’. D’ is the set of boundary points of G’ on X’ which are 
either a distance « from D, but not in R’, or are in d’-Ty. 

A boundary point of G’ on 2’ not in D’ + R’ must be a distance > « from D, 
and hence in B, by J. It follows from this that R’ contains all points of 
2’.(E — To) not in D’. Thus we are left to verify properties 3c, d, e. To 
verify that D’ + R’ is open relative to >’ it suffices to consider a point P of D’, 
since clearly R’ is open relative to 2’. If P is an D’-T7> it is clear that all the 
nearby points on 2’ are in E, hence in D’ + R’. If P is a distance ¢ from D 
the line PoP meets 2 in a point Q of D. If Q is in F, P is in S (since F CC 
and e < 6). If Qisin D — F the points near Q on = are in D and hence the 
points near P on 2’ are in D’. Thus D’ + R’ is open relative to 2’. This 
argument also shows that all the points near a point of D’ + R’ and outside 
>’ are in G’. 

D’ is closed. For let {P,} be a sequence of points in D’, converging to a 
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limit P. If an infinite number of the P,, are in D’- Ty so is P, since 7) is closed. 
If an infinite number of the P,, are a distance ¢ from D, say P,Q, = ¢, Q, in D, 
we may assume that the Q, have a limit Q (in D, since it is closed). But then 
we easily see that PQ = «¢ and hence P is in D’, for it cannot be in R’, which is 
open relative to 2’. 

Finally, F’, the set of limit points of R’ in D’, is a subset of C. For let P,, > P, 
Pin D’, P,in R’. If P is in 2’-Ty it is necessarily in C because of the nature 
of R’. If this is not the case P is a distance ¢e from D, and the nearest point 
of D is in F (other-wise the points near P on 2’ would all be in D’, as we saw 
when we proved that D’ + R’ is open relative to 2’). Thus Pisin S. It can- 
not bein E — 7) without being in R’ because of property (4) and the fact that 
P,— P, P, in R’. Hence P must be in 7), and on C. This completes the 
verification of the fact that 2’ is an attainable sphere, and finishes the proof 
of the lemma. 


4. Proof of the Main Theorem 


We have seen that there exists an attainable sphere 2» with a non-empty 
set D). The process of Lemma 2 then leads us to a smaller attainable sphere 
>. If 2, intersects C the set D, is non-empty, and we can obtain a still smaller 
attainable sphere, and so on. Now it is clear that the theorem will be proved 
if we can prove the existence of an attainable sphere which is smaller than 2p , 
and which does not intersect C (the connectedness of C enters here). 

The repetition of the process of Lemma 2 is capable of providing us with such 
an attainable sphere. For if it were not there would exist a sequence of attain- 
able spheres Xo, 21, --- with radii 7 > 7 > ---, all the 2; intersecting C, 
and the r; having a limit r > 0 such that the sphere 2 of radius r and center 
P, would be non-attainable. Furthermore, each 2,4; and its associated Gj: , 
vis1, ete. would be related to those of its predecessor 2; in the manner set 
forth in the proof of Lemma 2. 

We shall prove that under the circumstances just described the sphere = 
is attainable, thus yielding a contradiction. We begin by defining G = 
G+G,+.---,¢=@inG;. Evidently ¢ is single-valued and analytic in G. 
We leave it for the reader to verify that © enjoys the properties (1), (2) of 
an attainable sphere (as defined in §3). It is then clear that a boundary point 
of G on = which is such that in every neighborhood of it there are points outside 
2 and not in G is a limit point of B; hence it is itself in B. We shall call such 
points points of class I. 

The remaining boundary points of G on 2, which we shall call points of 
class II, have the property that the nearby points outside = are all in G. 

Derinition. R is the totality of points of class Il which are in E — Ty and 
such that ¢ = f at the nearby points outside X. D is the remaining part of class II, 
except for limit points of R which are in B. 

The only properties of D, R not immediately obvious are the properties 3c, d. 
We turn our attention to these. 
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Since R is open relative to 2 we may, in proving that D + R is open relative 
to 2, consider points of D only. If P is such a point all the nearby points on > 
are in class II. They are then either in D + R, or are limit points of R in B. 
But points of the latter kind form a closed set, and if there were some of them 
in every neighborhood of P, P itself would be a point of this kind, contrary to 
its being in D. Thus D + R is open relative to 2. 

F (as defined in (3d)) is a subset of C. For let P bein F. Being a limit 
point of R, P is either in EH — Ty, on C, or a limit point of Rin B. The latter 
cannot be the case, since P isin D. If it were in E — T> it would be in R 
because being a limit point of R we could infer that g = f at the points near P 
outside 2. Hence P must be on C. 

Finally, D is closed. For let P be a limit point of D. P cannot be in R, 
since R is open relative to 2. Hence if P is not in D it must, by (8f), be in B, 
for the boundary of G is a closed set. Two cases arise, according as P is in 
class I or II. 

P 1n Cuass I: In this case P is the limit of a sequence of points of B lying on 
the boundary of G outside 2. Now S is composed of the sum of a finite number 
of open spheres. It follows from this that B is such that there is a whole arc 
of boundary points of G, outside 2 with P as an end point. In particular, it 
follows that we may choose points Q; along this arc, Q; between 2; and Li+;, 
for all sufficiently large values of 7, and such that Q; — P. Then Q; is a dis- 
tance = 6 from D;, and gi; = f at the points of Gi,; near Q;. This follows 
from II, §3, the fact that Q; is in B, and the way in which 9,4; , Gi4: are defined. 
We shall show that all the points of G in a sufficiently small neighborhood of P 
are in E — Ty, and that g = f at these points. To do this choose j so that 
r; — r < 6/4, and so that Q;, 7 2 j, exists, with Q;P < 6/4. Let K be the 
interior of the sphere of radius r; — r and center at P. Observe that r; — ris: = 
e, < 6/4if¢ = Jj. Fora fixed 7 = j a point of K-G;,; on or inside 2; is a dis- 
tance S 6/2 from Q;, which is a distance = 6 from D;. Hence the point in 
question is a distance = 6/2 > e¢;from D;. Itis then in S, and gis; = f there. 
This argument shows that all the points of G in K are in S, and that ¢ = f 
there. Since P is in B all these points must be in EH — 7). 

We now recall that P is a limit point of D, so that we have P, — P, P, in D. 
When n is sufficiently large P,, will be in K. Thus all the points near P, out- 
side 2, being in G, since P, is in class II, are in E — Ty), and g = f there. It 
then follows that the points near P on > are either in E — 7) or in B (they cannot 
be on C because of proximity to B). Now not all of them can be on B, because 
of the nature of B, which is composed of parts of the boundaries of the spheres 
which make up S. Hence P, is a point B which is a limit point of R. This 
contradicts the fact that it is in D. 

P in Cuass II: The only remaining possibility is that P is in class II. In 
this case it is a limit point of R as well as of D. Let K be a small neighborhood 
of P such that all the points of K outside = are in G. Then the intersection 
of K with a 2; must consist entirely of points of D; + R;. If K is sufficiently 
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small there can be no points of F; in the intersection, for P is a distance > 6 
from C. By taking K smaller, if necessary, we can insure that the intersection 
with 2; contains points of R;, for in any neighborhood of P there is a point 
of R, and g = f at the nearly points of G. But if a portion of the intersection 
is in R; it all is, as otherwise there would be a point of F; in it. 

Now not ali the points in K can be in E — 7), for if P, — P, P, in D, this 
would lead, as we have just seen a moment ago, to the conclusion that some 
P, isnot in D. There must therefore be a point Q of B in K and outside ». 
We know that Q is not on any 2;. Suppose it is between >; and 2,,;, and 
goin Gi4i. By I, §3 and the definition of G;,; its distance from D; is S$ ¢; = 
r; - Tisai. The nearest point of D; cannot be in F; , for we can assume K very 
small, and this would bring P too near C. Thus, by II, §3 the radius of 2; 
passing through Q meets 2; in D;, contrary to the fact that >; intersects K 
solely in R;. Thus we are again led to a contradiction. P is then necessarily 
in D, D is closed, = is an attainable sphere, and the proof of the theorem is 
complete. 
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AN INITIAL VALUE PROBLEM FOR ALL HYPERBOLIC PARTIAL 
DIFFERENTIAL EQUATIONS OF SECOND ORDER 
WITH THREE INDEPENDENT VARIABLES’ 


By Epwin W. Titt 
(Received December 30, 1938) 


This paper deals with an initial value problem that can be set for any analytic 
hyperbolic equation (rank 2 or 3) linear or non-linear in three independent 
variables with any inclination of the initial surface. This initial value problem 
is a Cauchy problem in which we require that the initial data be analytic in 
one of their arguments but not in the other. Special instances of this problem 
have been considered by Volterra” and Hadamard on the one hand and Hamel’ 
on the other. 

The (n — 1) dimensional characteristic strips employed in this paper are an 
extension on the one hand of a concept well known in the theory of the second 
order equation in two independent variables‘ and on the other hand of a concept 
which occurs in the theory of the first order equation in more than two inde- 
pendent variables and which has not been clearly recognized.’ 

The method used in handling our initial value problem in a great many 
places bears a resemblance to the method used by Lewy’ in showing that 
Cauchy’s problem for a non-linear hyperbolic equation in two independent 
variables has a unique solution. In the proof of our existence theorems we 





1 Developed in part while the author was a National Research Fellow and presented 
to the American Mathematical Society, Dec. 31, 1935. See Abstract 42-1-40, Bull. Amer. 
Math. Soc., vol. 42 (1936), p. 32. The present treatment incorporates many improvements 
and additions. The author wishes to take this opportunity to express his appreciation 
to Professor G. A. Bliss for his interest and encouragement when this work was in its 
early stages. 

2 See J. Hadamard, Lectures on Cauchy’s Problem in Linear Partial Differential Equa- 
tions, Yale Univ. Press, New Haven, (1923), p. 254. 

3G. Hamel, Dissertation, Gottingen, 1901. 

‘See Goursat, Lecons sur l’intégration des équations aux dérivées partielles du second 
ordre, Tome I, Hermann, Paris, (1896), pp. 171-4. 

5 See E. W. Titt, ‘(mn — 1)—Dimensional Characteristic Strips of a First Order Equa- 
tion and Cauchy’s Problem,’’ Duke Math. Jour., vol. 3 (1987), pp. 740-6. Cf. Courant u. 
Hilbert, Methoden der Mathematischen Physik, I1, Julius Springer, Berlin, (1937), pp. 82-7. 

6H. Lewy, ‘Uber das Anfangswertproblem bei einer hyperbolischen nichtlinearen 
partiellen Differentialgleichung zweiter Ordnung mit zwei unabhangigen Verinderlichen,” 
Math. Annalen, vol. 98 (1927-28), pp. 179-191. An exposition of Lewy’s work has been 
given by J. Hadamard, Le probleme de Cauchy et les équations aux dérivées partielles 
linéaires hyperboliques, Hermann, Paris, (1932), pp. 487-508. See also loc. cit., Courant 
u. Hilbert, Kap. V. 
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employ a combination of the method of successive approximations and the 
method of dominant functions. Although Lewy used a method of difference { 
equations in his original paper, his existence theorems have since been handled : 
by the method of successive approximations.’ Our use of dominant functions ih 
in the existence proof for linear equations resembles somewhat Hadamard’s* 
use of dominant functions in the proof of the convergence of the elementary b 
solution. 
For a detailed formulation of our problem and the plan of the paper the ‘ 
reader is referred to the first part of §2. | 
: 

t 









Before proceeding to the discussion of our problem we wish to state the 
definition of a partially analytic function and mention certain of its properties. iY 
A function f(x, y) of two real variables z and y will be said to be partially analytic 
with respect to y for y = yo in the interval a < x S 8 provided it can be repre- 
sented by a series of the form 










(i) f(x, y) = ao(x) + ay(x)(y — yo) + a2(x)(y — yo)? +---, 


whose coefficients are continuous functions of x in the interval a S$ x S 8 and } | 
; 






provided that the series (i) converges absolutely and uniformly fora < x & 8, | 
'y — yo| S y. Sometimes in what follows it will be convenient to refer to a i 
set of points satisfying inequalities of the type a S x S B,|y — y| S y as } 

f 







the region of partial analyticity. 
In particular we have that f(z, y) is a continuous function of z and y for 











aSx<B,|y — y| Sy and the integral i f(x'y) dx’ can be evaluated using 





term by term integration and the integrated series will again converge absolutely 
and uniformly fora S$ x S B,|y — yo| Sy. Due to the absolute and uniform 
convergence we are able to obtain a function, 







™m™ 


1_¥o 
Tr 







in which m and r are positive constants, which dominates the series (i) for all x 
which satisfy a < x S B. Due to the presence of this dominant function the 
series obtained from (i) using term by term differentiation with respect to y 
will again converge absolutely and uniformly in any subregion a S z & 8, 
'y — y| S$ 8 < y. Suppose that we have another function g(x, 2) par- 









™In this connection the author is deeply indebted to H. Lewy. In 1935 the author 
had the privilege of reading Lewy’s lectures on partial differential equations delivered 
at Gottingen in 1927-28 (unpublished). Substantially the same treatment of the existence 
theorems is given in Courant u. Hilbert, loc. cit., pp. 317-326. 

* See loc. cit., Hadamard (1923), Chap. III. For an exposition see T. Y. Thomas and 
E. Titt, “On the Elementary Solution of the General Linear Differential Equation of ; 
= Second Order with Analytic Coefficients,’ to appear in the Jour. de Math. Pures et He | 

Pp. 
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tially analytic with respect to z in a region a S$ x S B, |z—a@| Se. If 
| g(x, 20) — yo| < 7, then the result of replacing y in f(z, y) by g will be another 
function F(x, z) partially analytic with respect to z in some region a S x < 8, 
|z — z| S » < «. The extension to more variables is evident and other 
properties such as implicit function theorems will cause no difficulties. 


1. Cauchy’s problem in general 


Consider the partial differential equation 
(1.1) F(pap | Pal 2 | az") = 0 (2,8 = 1, 2, 3) 


in one unknown z and three independent variables x“, where the pe and pag 
denote the first and second partial derivatives respectively of z with respect 
to x“. Cauchy’s problem for the equation (1.1) is the problem of passing a 
solution through an initial strip 
(1.2) a* = §(u, w); z= S(u, w); 

Pa = Ta(U, W); Pap = Tap(U, W); 


satisfying (1.1) and subject to the strip conditions 


Zu = Pau } Zw = Poly j 
(1.3) ap , 

vi sig = B . Pa = B ° 

au Paptu ; aw Paptw ; 


where the subscripts wu and w denote partial cifferentiation with respect to 
u and w. 

Let us inquire into the conditions under which the equation (1.1) will deter- 
mine the third derivatives pag, over the initial strip. Differentiating the equa- 
tion (1.1) with respect to x” considering all of its arguments as functions of x*, 
we obtain 








(1.4) aD apy +b, = 0, 
where 
a“ = pall 2a = ¥ (a ¥ 8), 
ODaa dDas 
oF oF aF 
b, = — — —, 
oe ae Day + a2 Dy + ax? 
In addition to (1.4) the derivatives pag, must satisfy the conditions 
ODa ) a 
(1.5) ily Papy Lu; “pet = Papy Xe - 


Keeping y fixed the matrix of the coefficients of the derivatives pag, in the 
equations (1.4) and (1.5) has the form 


(1. 


Th 


wh 
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The six rowed determinant in the upper left hand corner of (1.6) has the value 


—xia™D.Dsz, 





where 






2 3 3.2 Se . 48 
Dy = Luly — Luly ; De = Luly — Lukw, 


SR ee 
= =e * 
ne 


Under the assumption that the initial strip (1.2) satisfies the condition 
a”D.Ds ¥ 0, we can solve the first six equations of (1.4) and (1.5) for the six 
derivatives Papy (y fixed). There is no loss of generality in assuming in par- 
ticular that 2, ¥ 0. 

Now let us consider the characteristic case, i.e. the case in which the strip ae a 
(1.2) satisfies the equation 4) i 


(1.7) 






2. > feet 








a”D.Ds = 0. 





The six rowed determinant in the lower left hand corner of (1.6) vanishes 
identically. For if we multiply the first row by x}, the second by —z1,, etc., 
and add, we find that the rows are linearly dependent. Hence, the equation 
(1.7) implies that the rank of the matrix, formed by taking the first six columns 
of (1.6), is less than six. Let us assume, for example, that the determinant 













am O02z 0 0 
zs 0 zm 0 0 
(1.8) 0 2 O x O|} = 23 D,D. ¥ 0. 
0m 0 x O 
0 O xr, 
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Then if the equations (1.4) and (1.5) are to be consistent, we have six conditions 
which must be satisfied, namely 


22 13 23 33 
re se 2. € b, 


x 0 x 0 0 — Pry 
ou 


de tt ee 59 


ow 
u u au 


et 0 2&0 US 
ow 


0 0 1 2 3 OPsy 
™ aw 








and 


(1.10) ae “Per _ go Per og, 
du ow 

A strip (1.2) which satisfies (1.7) and conditions of the type (1.8), (1.9), and 
(1.10) will be called a two-dimensional characteristic strip. If we know a solution, 
z = 2*(x*), of the equation (1.1) then the quantities a“ are known functions 
of x; let us denote them by a**’. A surface 2* = 7%(u, w) which satisfies 
the equation 


(1.11) a**“D,Ds = 0 
will be called a characteristic surface for the solution z2*. If the non-parametric 
equation of the surface is ®(2’, x”, z°) = 0 then (1.11) becomes 


(1.12) a* bb, = 0. 


2. The initial value problem for / = 0 


Thus far we have been dealing with Cauchy’s problem in general. In this 
section we shall formulate the Cauchy problem which will interest us in the 
present paper, i.e. we shall formulate conditions to be imposed upon the func- 
tion F and upon the initial strip (1.2). 

We propose the following initial value problem for the equation (1.1). Let 
the function F and the initial strip 


(2.1) «2° =#%(s,w); 2=2s,w); pa = pal8,W); Pas = Pap(S, W); 


satisfy the following conditions: 
(a) The functions (2.1) and their first derivatives with respect to s shall be 


part 
equé 


(2.2 


(kt 
bork 


whe 
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will 
dev 
met 
lem 
and 
var 
pro 
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ma 
of 7} 
cer 
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an: 
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an: 
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tially analytic with respect tow. The strip functions (2.1) shall satisfy the 
equation (1.1) and the strip conditions 


2. = Pal; , Zw = Pakw , 
(2.2) ODa cae ie OPa — - 
as. =r Dash. , ow = Pasty - 


(b) The function F shall be an analytic function of its arguments in a neigh- 
borhood of the set of values *, Z, pa, Pas - 


(c) Along the strip (2.1) the following conditions shall be satisfied: 
(2.3) aA Ag ¥ 0, 


Aaprex’, < 0, 





(2.4) 


where the A, are analogous to the D, with u replaced by s; the Ag denote the 
cofactors of a** in the determinant a = | a“ |. 

The problem of determining a solution of (1.1) containing the strip (2.1) 
will be termed the initial value problem for F = 0. The remainder of §2 is 
devoted to interpreting the conditions (2.3) and (2.4) algebraically and geo- 
metrically in order that our problem can be compared to other Cauchy prob- 
lems. In the first part of §3 we introduce quantities %.. for future reference 
and then discuss the invariance of our problem under changes of independent 
variable z* and changes of parameter s and w in (2.1). We then reduce our 
problem to a canonical form termed the initial value problem for F = 0. In §4 
under the assumption that we have a solution z* of the initial value problem 
for F = 0, we map the space of codrdinates x* onto a space of codrdinates wu, », 
win such a manner that two one-parameter family characteristic surfaces for 2* 
map into the planes u = constant and v = constant. We then form a system 
of partial differential equations 2 with independent variables u, v, w by selecting 
certain of the characteristic conditions in §1 which must be satisfied over u = 
constant and v = constant. In §5, with the aid of a uniqueness theorem in §6, 
we show that a partially analytic solution of the induced initial value problem 
for 2 is also a solution of the initial value problem for F = 0. In §9, with 
the aid of an existence theorem in §8, we show the existence of a partially 
analytic solution of this induced initial value problem for 2. Also in §9, with 
the aid of a uniqueness theorem in §8, we show the uniqueness of a solution 2* 
of the initial value problem for F = 0 for which the mapping in §¢ is partially 
analytic. In §7 we point out certain simplifications in case F = 0 is linear 
and obtain a slightly different uniqueness result. 

We shall now examine the significance of the above conditions. The condi- 
tion (2.3) insures that the surface S: x* = #*(s, w) shall be non-singular and the 
rank of the matrix || a\|| must be two or three, if the condition (2.4) is to be 
satisfied. 
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Before proceeding further we observe an identity that will be of use to us 
several times. If we apply Lagrange’s identity 
(2.5) (a X b)-(c X d) = (a-c)(b-d) — (a-d)(b-c), 


(written in the notation of vector analysis) four times, we get 

















a”°Y.Ys a” YL = Pt. £3 he Y; ea 
(2.6) |aZaYs a”ZaZzp a“Z_ a"Za||Zs Zs 
= (Aak*) R'+ ... = AgsR*P, 
where 


R'=Y¥.2;-Y:22, R?=Y¥:2i—YVi23, R= Vide — Yo, 


and similarly for R*.° 

The condition (2.4) implies that the form (2.3) must be indefinite (semi- 
indefinite) for if the form (2.3) were definite (semi-definite) then the adjoint 
form (2.4) would be positive definite (positive semi-definite) and (2.4) could 
not be satisfied by a real vector x. The case of rank three is covered by a 
discussion of Dickson” and we give here a proof following the same lines which 
covers the case of rank two. 

Let us assume that the form y = a“Y.,Ys is positive semi-definite of rank 
two, ie. Y 2 O for real Y,. We can without loss of generality assume that 
a" ¥ 0, for if all the a** were zero then at least one of the a” (a ¥ 8) would 
be different from zero and ¥ would take on both positive and negative values. 
Since ¥(1, 0, 0) = a”, we have a’ > 0. By taking Ye = Ya, Za = Za, 
Z, = 1, Z. = Z; = 0 in (2.6) we have 


(2.7) a'W(¥1; Y2, Ys) = L’? + Ass(Y2)” — 2AosV2¥s + Ano(Ys)’, 


where L = a“Y,. One of the quantities As, Ass, let us say Ass ¥ 0, for 
if both were zero then the principle minors obtained by bordering a” with one 
row and one column, and also with two rows and two columns would vanish 
and || a|| would be of rank one. Then take Y; = —a”, Y2 = a", Y3 = 0 
in (2.7) and it follows that A3; > 0. Multiply (2.7) by Ass and use | a|-= 0, 
we get 


a Awy(Y1 »¥2, V3) = Agl’ + (AssYe — AasY3)°. 


TuroreM I. In a positive semi-definite form (rank 2) by a change of sub- 
script we can make a’, As3 positive. If a" and Ags are positive then a form (rank 2) 
ts positive semi-definite. In a negative semi-definite form replace the above by 
a” < 0 and Az; > 0. 





* The frequent use of Lagrange’s identity in what follows is the outgrowth of a remark 
by Dr. Tobias Dantzig. 

10 See L. E. Dickson, Studies in the Theory of Numbers, University of Chicago Press 
(1930), pp. 1-10. 
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The form YW = AggR*R? is of rank one if y is of rank two. Hence, if we 
apply (2.7) to ¥, we see that the adjoint of a semi-definite form (rank 2) is a 
positive semi-definite form (rank 1). We sum up the above remarks by saying 
that (2.4) amplies that the equation (1.1) is hyperbolic for the initial data (2.1). 
Let us for the present think of 


(2.8) a“, = 0 
as a conic in homogeneous point coordinates ©, and think of 
(2.9) LyPe = 0 


as a line. 
THEOREM II. The inequality (2.4) is a necessary and sufficient condition that 


the line (2.9) cut the conic (2.8) in two real and distinct points ®, . 
To show that the condition is necessary let Y, and Z, be two real and dis- 
tinct points on the line (2.9) which do not lie on the conic (2.8). Then the 


equation 
(2.10) a®(Ya + dZa)(Vs + AZs) = a YiVs + 2a” YZ, + Xa ZZ, = 0 


must have two real and distinct roots 4. Since the R* are proportional to the 
z, it then follows from (2.6) that the condition (2.4) is necessary. In order to 
be able to reverse the argument there must exist two points on the line but 
not on the conic. If every point on the line (2.9) satisfies (2.8) then pick two 
such points Y, and Z,. Equation (2.10) must be satisfied for all values of 4, 
ie. the left member of (2.6) must vanish. Hence, the left member of (2.4) 
vanishes also, contrary to hypothesis. 

We divide the discussion into two cases according to the rank of the 
matrix || a ||. 

Rank 3: The equation (2.8) can be regarded as the tangential equation of 
the characteristic cone having the corresponding point P of the surface S as its 
vertex." Condition (2.3) then implies that the tangent plane to S at P is not 
tangent to the characteristic cone with verter at P. Also (2.9) can be regarded 
as the tangential equation of the vector x; which is a vector at P. From the 
above result on conics we see that the condition (2.4) can be regarded as re- 
quiring that through the vector z{ at P there pass two real planes tangent 
to the characteristic cone with vertex at P. In other words (2.4) implies that 
the vector x% at P lies outside the characteristic cone with vertex at P. 

Rank 2: In this case the characteristic cone becomes a pair of real inter- 
secting lines through P. Condition (2.3) then implies that the tangent plane to S 
at P does not contain either of the lines which constitute the characteristic cone 
with verter at P. The condition (2.4) implies that these same lines together 
with the vector x% at P determine two distinct planes. In other words (2.4) 
implies that the vector x% at P does not lie in the plane of the characteristic cone 
with vertex at P. 


"Cf. Loc. cit. Hadamard (1923), p. 21. 
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We give three examples which indicate the scope of the initial value problem a” 
for F = 0. al 
cay = put pe — ps = 0, a“Awds = — 1, 3 
S:2=0, 2 =s8s, r =v, Aasxext, = —1, A 
= pu + po — pss = 0, a“AAs = 1, 
(By |” 3 2 a B T 
pe x=8, x =w, Aapluly = —1, : 
4 | 
ci" = pu — po t+ ps = 0, aA As = 1, it 
S:2=0, 2=s, =u, Aasters, = —1. T 
In example (A) the characteristic cone with vertex at a point of S is in 
(z’ — 2)? + (2? — 23)? — (2°)? = 0 and the vector x2 ~ 0, 1, 0 lies outside (3 
this cone. We note that in (A) s and w could be interchanged. In (B) the p 
characteristic cone is (2')’ + (x° — x9)” — (a° — 2)” = 0 and x2 ~ 0, 1, 0. 
Here z' and 2’ could be interchanged but not sand w. In (C) the characteristic v: 
cone consists of (x')? — (2° — 2)? = 0, x* = 2), a pair of lines. The vector n 
x» ~ 0, 0, 1 is perpendicular to the plane of these lines. In (C) the surface S 
could have been 2’ = s, x” = 0, 2° = w. ( 
3. The initial value problem for * = 0 dl 
0 
We return to the consideration of the conic (2.8) and the line (2.9) together 
with the line 
(3.1) Q°S, = I 
The condition that the lines (2.9) and (3.1) intersect in a point on the conic 
(2.8) is given by 
(3.2) a” YY, = AeQ*Q’ = 0, 1 
where 
Yi=Q@n-Qn, Yr=Oxn-Qn, Y:= Qs — Qn 
a 


The quantities Ys, are defined by (3.2). Recalling the manner in which we 
obtained (1.7) from (1.6) we get the following explicit formulae which occur 
often in our work 


22 23 33 
a@ a a oe 
2 3 1 3 
2 3 1 3 
a, =— 0 ya 


A simpler method for obtaining %; for present purposes is to put te 1, 
Q’ = Q’ = 0 in (3.2). Then Y; = 0, ¥2 = 2», Ys = tp and Un = a” *Y Ys. 
If we put Z, = ae, 2 = 0, Z; = —zi, and W, = —z.,, We: = Le ; W; = 0 ! 
then Noo = @ 07 alp and P) Ps = a “WW . Also Mre = a” Y Zs, Mis -” 
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a®Y.W, and %s = a”Z.Ws. Then R' = 2izl, R? = xiz?, R® = (x3) 
and (2.6) yields 
(3.3) %11%22 — (Wie)? = (22) A apr er’ . 
A similar calculation yields 
—(An% ss — Ars%ie) = xea2A apxer’ . 

These relations together with four analogous ones show that the adjoint to 
|| Mog || is of rank one. Hence, || %ag || is of rank two. From (3.3) and (2.4) 
it follows that each principal minor of | %.s | is negative or zero. Hence from 
Theorem I it follows that the form (3.2) quadratic in the Q*, is semi-indefinite. 

TueorREM III. The form (3.2) factors into two real and distinct forms linear 
in the Q*. Geometrically this means that the family of lines (3.1) satisfying 
(3.2), i.e. the family of lines through the intersection of (2.8) and (2.9) is com- 
posed of two distinct real pencils. 

Let us investigate the effect of a change of codrdinates x“ upon our boundary 
value problem for F = 0. If we require that F transform as a scalar under the 
non-singular analytic transformation 


a 


(3.4) #° = £°(2', 2’, x), 


then the quantities a“ must constitute the components of a contravariant tensor 
of second order, i.e. 


at’ = a” ax" az 














ax? aa°° 
It follows immediately that @ = a| a%/dz |’ and it is easily seen that” 
ax” dx’ | az |” 
pe dig Ae hy 
po age aR | ax 
The quantities z$ are the components of a contravariant vector, i.e. 
wail g Ox" 
Ly = wan rR? 
Tw aap 
and making use of (2.5) we have 
0% = OF" az az 
ws = Ls => ae ne 2 3 
i ax8 ax? oz? Oa*| |%s Xe 
1= = . 
yt oy ae'| fant as'| [at at 
” axe ~™ aah ax? aa? 
dx’ | 0% ax" 4 
= eee Ree ooo Aa — | — 
1 agi | ax | * az | ax 














2 A proof that will hold even in case || a || is of rank two can be modeled along the 
lines of L. P. Eisenhart, Riemannian Geometry, Princeton University Press (1926), p. 14. 
A direct proof, similar to that given below for the law of transformation of the A, , could 
be given. 
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or 
m ax” | dz | 
Bie = a Pv . 
* az | az | 
In the above we have made use of the fact that the quantities ax°/d2' satisfy 
the system 
av” ax" sg 
nga c= * lene 63 ° 
dx* Oz! 
Hence the laws of transformation for the quantities appearing in (2.3) and (2.4) 
are given by 





~apR ap Oz 

. AJA = A.A | ce , 
(3.5) a 3 — 
Oz 
Aap, = Angee, | 2 
pl wt. B ar 








Thus the conditions (2.3) and (2.4) are invariant under non-singular transforma- 
tions (3.4). 
Equation (3.5) can be written in the form 


P| 2 
RR 


me | 2 
Wack 28 - Nagrs x ie | - 
Make use of the law of transformation for the vector x¥ and the quotient law 
of tensors, we get 

az 12 
Oz | 


ax’ ax’ 


Mas = Un os 








In view of (2.3) and (2.4) at least one of the quantities %., and at least one of 
the quantities z;,; must be different from zero at each point of the strip (2.1). 
Then choose the transformation (3.4) or more particularly the derivatives 
dx*/dz' so that %1, + 0 and the derivatives dx*/d%" so that Yo ~ 0. If neces- 
sary alter this choice to make the determinant 


ax’ aa 1 








az age 7” 
x” aa” 2 
ag! ag? 7 
ax* a2° 3 


different from zero. The value of 7%, obtained by solving the system of three 
equations 


aa* ~B 
axe ty = Te 


will be different from zero. 
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Suppose that change of variable x* of the above type has been made; we will 
now make a change of parameter that will make # = w. Since # = 0, the 
change of parameter 


(3.6) g= 58, 


has an inverse of the form 
s = 3, 
w = w(8, w), 
where w is partially analytic with respect to w. From (3.6) we have 


aa" _ ae" | ax" a a" _ aa” a 
as a3 am as’ aw aw dw’ 





and it is easily seen that each of the conditions (2.3), (2.4), %u ~ 0, Uo ¥ 0, 
remain unchanged under the above change of parameter. 

After the change of variable (3.4) followed by the change of parameter 
(3.6) has been made we shall refer to our initial value problem as the problem 
for F = 0. The initial value problem for F = 0 satisfies in addition to condi- 
tions (a), (b), (c) the conditions 
(3.7) # =», 

(3.8)  Wn(ws)? + Wiowirs + Yeo(zi)’ ¥ 0, 
(3.9)  %n%xe — (Yi)? < 0, 


(3.10) Au ~ 0, Yoo ¥ 0. 


(c’) 





4, The initial value problem for 2 


In this section we assume that we have a solution of our boundary value 
problem for F = 0, namely a function z = 2z*(z’, 2’, x°) which satisfies (1.1) 
and contains the strip (2.1). If we look for characteristic surfaces for z* in 
the form (2*) = x’ — ¢(z’, z*) = 0 then the partial differential equation (1.12) 
becomes 


(4.1) Om a*™(,)" a 20** ous ea a**(y,)? “eal 2a" 0s Ri 2a*"*y5 + a*" = 0. 


Cauchy’s method of integrating the equation (4.1) employs the one-dimensional 
characteristic strips which are solutions of the system of ordinary equations 


dx* a dx on 
dv dg.’ = dv" Oy’ 
(4.2) dy d¢ y re) 
2. B . (a = 2, 3). 
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Then take a solution of the system (4.2) depending on five arbitrary constants 
Xo , Yao , namely 


Qa 


x 


Il 


x*(v | ro | Pad); 
(4.3) ? 
Palv | Zo | a0); 


Ya 


which reduces to xo , ¢gao for vy = 0. 

The Cauchy problem which will interest us is to pass a solution of (4.1) 
through each of the curves C” (s = constant) lying on the surface S. The 
initial strips are obtained by solving the two equations (4.1) and 


(4.4) —2), + rigs + rigs = 0 


for g and 3 for each value of s. Take account of (3.7), multiply (4.1) by 
(x2,)’, use (4.4); one obtains 


(4.5) Ai(v2)” + WWiyve + ee = 0. 


On account of (3.9) and (3.10) the two solutions go(s, w), Ze(s, w) of (4.5) will 
be real and distinct. Thus when we replace the arbitrary constants in (4.3) 
by £°(s, w), Za(s, w); €°(s, w), Za(s, w), (s = constant) we obtain two surfaces 
satisfying (4.1) through each of the curves C”. 

If instead of replacing the arbitrary constants in (4.3) by functions of w 
alone, we replace them by the functions #*(s, w), ga(s, w), also by #*(s, w), 
Za(s, w) we obtain two sets of functions 


(4.6) (a) z* = X“(v, 8, w), (b) z* = Z°(v, 8, w) 


each of which gives a mapping of the portion of the x* space in the neighborhood 
of S upon the space with codrdinates v, s, w. We have merely to show that 
the Jacobian of (4.6) is different from zero at each point of S. On account of 
(4.2) at a point of S (v = 0) the Jacobian of (4.6a) is 


\2a"°Y.53 &: £, 


| 








(4.7) 2a" Y.03 2 a| = 2a" YZ, ¥ 0, 
2a" Y,83 a ad 
where Y; = —1, Ya = @3;Za = Aa. By (4.1) Y, lies on the conic (2.8) and 


by (2.3) Za lies off the same conic. In view of (4.4) both Y,. and Z, lie on 
the line (2.9). Hence (4.7) must be satisfied or (2.10) would have two roots 
\ = 0 and (2.9) would be tangent to (2.8). 

For v = 0, as a function of s and w, the determinant 


|X, X; X;| (2a Ys 2a% 83 2a” Yds 
2a°Z 53 2a%Z.53 2a"°Z.ds |, 


my my my niga 


(4.8) 
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eee 


where Za ~ —1, $2, 9s. If we write R' = ads — ah, RK = & —&, R* = 
d — %, then a account of identities of the type (2.6) the quantity (4.8) be- 
comes 4A aght® #%,. But by (4.4) the R* are proportional to the #2 and hence by 
(2.4) the quantity (4.8) is different from zero over S (v = 0). 

Consider the change of variable 


eS ae 
> Anat - 
ee 






=», 


p= y, v 
(4.9) (a) §=<8, (b) S=<s, 
® = X°*(v, s, w), wb = Z(v, 8, w). 


Since (4.9) is the identity transformation for vy = 0, the initial value problem 
for F = 0 will not be affected by this change of parameter. For the same 
reason the set of transformed functions (4.6), namely 


(a) 2 = X'(,3,0), (b) 2 = 2'6,3,0), (¢ = 1,2) 





RES ae ee ~_ 
A — 





(4.10) Bort ee 


3 
’ t= Uv, 


r= 


Sl 






= 





|| 
will reduce to “(s, w) if we put » = 0 or § = O as the case may be, and put i 
s=§=3,w = © = w. From the form of (4.9) it is easily seen that the sur- i | ‘a 
faces obtained by putting § = constant or § = constant in (4.10) are char- } | eae 
acteristic. On account of the nonvanishing of (4.8) and the form of (4.9) it i le 

a 

if 


follows that ford = > = 0,8 = § = 3,w = BH = W®, the quantity 







oa cae i koe 
5 5 O wt 
~ ae , lf . 
(4.11) Be E 0|#0. Mis 
RL xt x bi I 
® ® w . an 4 
Let us identify w, 0, #, put § = bi: u, § = —+/2» and consider I \ 
(4.12) RX, V2u, w) — EG, —V20, w) = (i = 1, 2) a | 








as equations for > and # in terms of u,v, w. For u + v = 0 the equations (4.12) 
possess the solution * = » = 0, and by (4.11) the Jacobian of the left members 
of (4.12) with respect to %, > is different from zero for u + v = 0. Hence, 
(4.12) possesses a unique solution > = 7(u, v, w), § = H(u, v, w) which reduces 
tov => = Oforu+v=0. Differentiating (4.12) we find that the Jacobian 


a (rs mm) 
“ae 8) §) 

















A080) _ _ jg? _ _» 


a(uvw) 






- #0, 
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foru + v= 0. Hence, from either of the mappings (4.10) we obtain 
(4.13) x’ = a'(u, », w), 2 =w (¢ = 1, 2), 
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which maps the portion of x* space in the neighborhood of S upon the space of 
coérdinates u,v, w. If we regard the u, v, w space as Euclidean with rectangular 
Cartesian coérdinates then the planes w = constant map into the surfaces 
z*® = constant. The plane wu + v = O maps into S, ice. (4.13) reduces to 
&°(s, w) if we put /2u = —V/2v = 8s. The planes u = constant and v = 
constant map into characteristic surfaces. The variable s measures distance 
along the lines u + v = 0, w = constant. 

Upon replacing the z* in the z* and its derivatives by means of equations 
(4.13) we obtain a set of functions 


(4.14) a%(u,v,w); z(u,v, w); Palu,%,W); Dap (u, v, w) 


which reduces to (2.1) for u + v = 0. A repetition of the argument used in 
§1 shows that the functions (4.14) for uw = constant constitute a two-dimen- 
sional characteristic strip; similarly for v = constant. In fact the functions 
(4.14) must satisfy the equations 


(wi)? + Worx, + Ae(rz) = 0, 
Ai(xd)? + Warorra? + Aso(xz)” = O. 


In view of (3.9) and (3.10) the two roots p and o of the quadratic equation 
(4.15) are real, distinct and different from zero in the neighborhood of 
S(u + v = 0). Since in (4.13) xia? — xi.2} ¥ 0, the equations (4.15) can be 
replaced by the first two equations of the system (4.16). The remaining 
equations of (4.16), which we shall refer to as the system 2, are some of the 
conditions which express the fact that (4.14) for « = constant or v = constant 
constitute a two-dimensional characteristic strip. 


(4.15) 


2 1 
Ly — prt, = 0, 


(a) <2; —oxi =0, 


a= 0, 
Zu = Paki; 
(b) ODa ‘ 
Bu = Papru ; 


(4.16) 
(c) | te Pry + Au of a, + E,xi, = 0, 
ou du 








a ODaa dee ODaa on — 
@) < ap 2s = 9 (a = 2,3), 
(e) o op + b) Fr ope | x, - Ex? as @. 
ov Ov 


The equations (4.16c) are obtained by expanding (1.9) and making use of 
(4.16a). Thus the quantities H, depend on a, b, , dpa,/dw, x2. Conversely, 
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fz, ¥ 0 the equations (4.16c) and (4.16a) imply (1.9), and we shall find in 
computation that it is often more convenient to use the form (1.9), rather 
than (4.16c). 

We shall refer to the problem of finding a solution (4.14) of the system = 
which reduces to (2.1) for u + v = 0 as the initial value problem for =. Evi- 
dently a solution of the initial value problem for F = 0 is also a solution of 
the initial value problem for 2. 


5. Equivalence 


This section will be devoted to showing that a solution (4.14) of the initial 
value problem for 2, which is partially analytic with respect to w, is also a 
solution of the initial value problem for F = 0. Todo this we must show that: 

(1) The variables x* can be introduced as independent variables in place of 


U, Vv, W. 
(2) The functions (4.14) satisfy (1.1). 
wy Pe xe oe _ 

axe P@)  Braqgp Pat 


First consider the one equation z*, = 0 with initial conditions # = w. Evi- 
dently the function z* = w satisfies the above equation and initial conditions. 
The uniqueness theorem in §6 shows that the solution is unique. Thus the 
function x*(u, v, w) is independent of u and v. 

In order to show that, in (4.14), we can introduce the variables x“ as inde- 
pendent variables in place of u, v, w we must show that the Jacobian 


za, t, 0 
a x 0|4¥0 
Be Be Xe 
over the initial plane uw + v = 0. The quantity 


Dy (Culy — LyXy) = TylyFy(o — p) 


is different from zero provided we can show that x, and zx, do not vanish ini- 
tially. Suppose that 2}, does vanish over u + v = 0; then from the form of 
(4.16a) it follows that x; also vanishes. Since s is the arc length along the 
line u + v = 0, w = constant, and u = (1/+/2)s and v = (—1/+/2)s, we get 


= (% -%) Fe (i = 1,2). 
as ou w)/J/2 ~~Jf2 a 


Hence 


ax” 9 = (= 72) = 9 
as ds = of 2 \ av av , 


and in view of (3.8) our assumption that z\, vanishes leads to a contradiction. 
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Similarly zx} must be different from zero over u + v = 0. 
Let us adopt the notation 


U=2— pets, V=2&— pale, W =2u — Deke, 


Ge au = Paplu ’ ¥e a aw Paptw . 
From the differential equations (4.16b) we have U = U, =0. Making use of 
the fact that the solution (4.14) of the system 2 has continuous cross deriva- 


tives, we have 





OW. “f OU _ OPap 8 _ OPap p 
m5) = "a OR 
Consequently the equations (4.16d) imply that 

OWe _ ow; 
“= Feat 


In consequence of condition (c’) the ratio x?:2) is different from 22:2) initially: 
Hence the determinant 


zr x 0 

a 2s 0/#0 
over u + v = 0, and it follows from (2.2) and (4.16b) that az/dz* = p,, 
dp./dx° = Pag over u + v = 0. In other words 
(5.3) V=V.=W=W.=0 over u+v=0, 


which together with (5.2) implies that W. = W; = 0. 

Multiply the last columns of the determinants (1.9) by x2 respectively and 
add the resulting equations. Then in the determinant in the left member of 
the resulting equation, multiply the first five columns by certain factors and 
add to the last column and we obtain 





a a? Qa" a g® yy a” “Pee + b,xt 
2 0 0 0 o ~ me 
ou 
20 2.0. 0 .<2acS 
ou ou = 0, 
ot 2 » 0 ~Sag 
ou 
. 2650, @ & Opis Ly 
ou 
0 O Lew x x, 0 





> 4A @®d zt 
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lacking. Expanding the above determinant we have 








du 


i 
where >,’ denotes the sum indicated except that the term containing a” is : 
1 P2a"DaDp + totate {(2e)*(are2 ee 2)+ (2a"*xy — a2) | = 0, 

\ 
{ 

























which becomes i 
> of aw a || 
va- (5.4) vy ry ty \ (te) Fu + (272% — a® 2h wii = 0, t 
' \ ou ] 
when account is taken of (1.7) or (4.16a), and (4.16b). i! 
In a similar manner we obtain from (1.9) the equation f i 
3 Opi a” 1/,.3\2 [ a6 9Pap aw, 4 
Cw aw D «Ds + Tote {28@%) (« my + b,22) — Us sul = 0. pe ; 
; When account is taken of (1.7) and W. = W; = 0 this becomes i 4 
y: Rt ; 
\ =! 1 /,3\2 i fi | 
(5.5) oy 22 —— — ry, (ry) (FY, - Fy.Wi ‘ad FW) - it if 
He 
It is easily seen that / | 
(5.6) | 
when use is made of Wz. = W; = 0. The system of linear partial differential i | 
equations (5.4), (5.5) and (5.6) in the unknowns F, W, W, has an identically Ni 
vanishing solution. In view of condition (a) and (5.3) and the uniqueness i ka 
theorem of §6 we must have F = W = W, = 0. 1 fe 
d Multiply the first of equations (4.16c) by x3 and the equation (4.16e) by 2’, ih ie 
: and subtract and we have upon making use of (4.16a) the equation Ih 









Use Opu . Win Apri2 Ase opu Yu Opie 


—i—— 9 


x, eu 2 ou ty Ov 2 ov 










Introducing x“ as the independent variable this becomes 


Opu ( Zu Pe (= a =) 
" me =)- oat Gt) 


From equations (4.16a) we have 








11 
Yee Lu Ly 
aot: aes 
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and since 2,25 — x22} 0, it follows that 
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Analogous to (5.1) we have 


OV. _ ODag J . ODasg wf 


we ee 


which together with (5.7) implies that 8V,/@u = 0. Thus we have V; = 0 
We have yet to show the vanishing of V, V2, Vs. Analogous to (5.6) we have 


(5.8) Tn OF abe: 
ou 


It is easily verified that 2{aV./du = 2x.0V./dw when account is taken of 
U,. = W. = 0. Since V; = 0 we can then write 


2? af 3 aVs 2? wd, 3 oo 


(5.9) + 2% ee ee ee +z 
ou 


Combining equations (1.9) in a third manner we obtain 


3 Opr2 a”D. D 3\2_2 ap ODag :) a dV2 
tw a, gt Ly ty 4 (ty) tu (a = + b,x? aa 


oe | (2a — gz? hl + gz! #4 oe 
ow 
Making use of (1.7), F = 0, and V; = 0 this becomes 


2 zt Su, ett | (20 3 _ g®,2) + at 3 38 aVs 


\ aw 
(5.10) 
+ (FV + Fp,V2 + F,Vs) al | ae 


Equations (5.8), (5.9), and (5.10) together with (5.3) imply that V = V2 = 
V3 = 0. 


6. An existence theorem for linear partial differential equations 


Consider a system of n partial differential equations of the form 


=1,°::,n 


a We a a 
(a) af = b; ou o + Fy + di, a By 
*4j=1,°°+,p 


a,b=pt+l1,:--,n 


in which the y. are to be determined as functions of u,v, w. Here the variables 
u, v, 8, w have the same relation to Euclidean space with rectangular Cartesian 
coérdinates as in §§4, 5 but otherwise the notation used in this section bears 
no relation to the rest of the paper. 

We propose the following initial value problem. We have given over the 
plane u + v = 0, the functions y{(s, w) which are partially analytic with 


(6.1) 


(b) de He = bs ote + coYa + da, | 
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respect to w for w = wo in an interval G*. The as , bs , cs , da , das /du, Bas /dv 
shall be functions of the three variables u, v, w, partially analytic with respect 
to w for w = Wo in a two dimensional region G containing G* in its interior. 
We can just as well assume that G* consists of all points of u + v = 0 in G. 
The determinant | ag | shall be different from zero in its region of definition. 
Under these assumptions we shall show that there exists a unique solution 
ya(u, v, w) of (6.1), partially analytic with respect to ’ for w = wp in a sub- 
region G; of G, which takes on the given initial values y{”(s, w) over u + v = 0. 
As a first step we make the change of unknown 
















(6.2) Zp = GBYa, 





and the system (6.1) takes the form 








ae gg Be 
Pe + gfzatd;, 





(6.3) 





o 






= fe S24 git + de. 







We adopt the notation 










za (8, W) = days (8, Ww) 








for the transformed initial values. ie 
We employ a method of successive approximations and define the v‘* approxi- j If 
mating function 2, by the equations sat 





v—l 


2; = 2f(—v, v, w) + f pr Oa + gf2n7 + a} (u’, v, w) du’, 












(6.4) i I 
= 2 (u, —u, w) +f ise Me den + giz," + da} (uy v’, w) do’. ‘| 

By z{”(—v, v, w) we mean the assigned initial value z2{”(s, w) at the point on at 

the plane u + v = 0 with codrdinates —v, v, w. The variables u’ and v’ which yi 





appear as arguments in the integrand are variables of integration. To complete 
our definition of 2, we take 










0 (0) 
a= 2 (—v, Vv, Ww), 
(6.5) ( ) 






ze = 20 (u, —u, w). 





The z’,(u, v, w) defined by (6.4) and (6.5) will be partially analytic with respect 
to w for w = woin the region G. In order to be able to perform the integrations 
indicated in (6.4) we must restrict G to be of such a shape that if u, v lies in G 
so does the entire triangle with vertices (u, v), (u, —u), (—, »). 
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For convenience we take w) = 0 and choose positive constants m and r so 
that 


filet. 
ang) 
r 
(0) 


dominates each of the coefficients fz , g3 , da and z, for each pair of values 
u,vin G. Let f(w) be any function analytic in the neighborhood of w = 0 
such that 


(6.6) 


M 


where < means “dominated by.” Then it follows that 


(6.7) fw) « a 


h+1° 
r(1 — *) 
r 
Let us note that 


(6.8) iLqad+Se-£5. 


p=1 


fw) « 


From (6.4), (6.6), and (6.7) we have 





0 Re ee 49] where Ms = m(1+ nm + 2”), 
w 
a 
In turn we get 
(6.9) o- t'<« Fa eer (u = 1,2, -++), 
w 
(7) 
where 
M, = wetes (> SA 1) (u = 2,3, +++). 
im r 
Put 





Limit (M+) oe nak 


po M,-1 i 


Select a positive constant k < rand let G, denote a subregion of G which satisfies 
the inequality 


(6.10) wietvlo sey 


28 


Sine 


initi 
thar 


for 
fun 


ver 
wit 


pls 
co 


tw 
Co 


S 












les 


883 





INITIAL VALUE PROBLEM FOR HYPERBOLIC EQUATIONS 


since |u + v| is proportional to the distance from (u, v, w) to the plane 
y + » = 0 this inequality requires that we remain sufficiently close to the 
initial plane. In view of (6.9) each term of (6.8) will be in absolute value less 
than the corresponding term of the convergent series of positive constants 


(4 [m+ Za) 


for values of u, v, in G, and values of w satisfying | w| S k. 

Hence, the function z,(u, v, w) converges uniformly to a continuous limit 
function za(u, v, w). Furthermore making use of the theorem on absolute con- 
vergence of double arrays the limit function z,(u, v, w) will be partially analytic 
with respect to w for w = win the region G;. Let » — @ in (6.4) we obtain 


ow 2PT. : a Oa . ! 
ea (a) w~z=a ( oon) + [Ase + gtee + ah dw, 


(b) za = 25" (u, —u, w) + [ {fe a + ge 2a + i) dv’. 
The functions z.(u, v, w) satisfy the initial conditions. Differentiating (6.11a) 
with respect to u and (6.11b) with respect to v we see that the system (6.3) is 
satisfied. 

Consider the uniqueness of the solution. Let z{? and z{” be two such solu- 
tions of the system (6.3) which take on the same initial values 2!” over the 


planeu + v= 0. We will show that in a neighborhood of u + v = 0,2? = z®. 
The functions 2°” and 2°? must both satisfy the system (6.11). Select the 
constants m and r so that (6.6) dominates the coefficients of (6.3) and also the 
two solutions 2°? and z. Replace the za. in (6.11) by z{” and 22”, subtract 


corresponding equations and obtain in succession 


M 
er |u + |? (u = 1,2, ---), 


r 
where 


Ms = 2nmt(1 +1), u, = “Mes(—1 41) (u = 2,3, ---). 


1 2 
tn — ta <K 





Select G, as before and obtain 


yw 

ja — of] < M5) (u = 1,2,---). 
(-;) 

(2) 


In the limit we obtain 2? = 2” for u and v in G, and | w| S k. 
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7. Initial value problem for a linear equation F = 0 


In case (1.1) is the linear differential equation 
(7.1) F =a”pas + cp, + dz +e =0, 


where the coefficients a*°, c’, d, e are functions of x* alone, the analysis of §6 
can be used to show both the existence and in a sense the uniqueness of a solu- 
tion of the initial value problem for F = 0. 

In the case of the linear equation (7.1) the foregoing theory can be con- 
siderably simplified. The pag can be dropped from the strip (1.2) and the 
corresponding strip conditions (1.3) can be omitted. Then the differentiation 
(1.4) is unnecessary and the last column of (1.6) is replaced by b, —a@p,/du, 
—dp,/dw, —dp2/du, —dp./dw, —dp3/du, —Ap;/dw, where b = c’p, + dz + e. 
The equations (1.9) and (1.10) are correspondingly reduced from six to two 
equations. The characteristic surfaces of (7.1) are determined independently 
of a solution z* and let us examine §4. Since the a” are analytic in their argu- 
ments z*, the functions in (4.3) can also be taken to be analytic in their argu- 
ments. When one inspects the changes of variables used in obtaining (4.13) 
it is seen that the functions x* in (4.13) are partially analytic with respect to w. 
Since the mapping (4.13) can be obtained independently of a solution z* of 
F = 0 we exclude the equations (4.16a) from the system 2. The system = 
is replaced by the system >’ containing four equations 


Zu = Dal, 
25 (te p se *P) + 2. E =0, 
ou ou 


(2’) 
a Pa _ a OPa _ 


w —_—s 


au “ ow : 
x, (ta Ph + Wh = + 23E = 0, 
ov Ov 


where E is linear in dp,/dw, pa, 2. Remembering that the z* are known 
functions of u, v, w it is seen that 2’ is linear in the z, p. , and their derivatives 
with respect to u, v, w with coefficients partially analytic with respect to w. 
In fact the system 2’ is of the type considered in §6 when we note that the 
determinant 


px.» An O 
(7.2) Le te te sed Lv (Moe Wr1)?(o ma p). 
o Moo Ma 0 


The non-vanishing of (7.2) over u + v = 0 follows immediately from condition 
(c’) since p and o must be different and both different from zero. 








valu 
argl 
a 80 
tion 
par 








of §6 
solu- 


con- 
the 
tion 
/du, 
+ e. 
two 
tly 
gu- 
gu- 
13) 


of 


n 





INITIAL VALUE PROBLEM FOR HYPERBOLIC EQUATIONS 885 





















By the existence theorem in §6 there exists a unique solution of the initial 
value problem for 2’, which is partially analytic with respect to w. By an 
argument’ similar to that in §5 it can be shown that this solution of >’ is also 

a solution of the initial value problem for F = 0. Conversely, consider a solu- 
tion of the initial value problem for F = 0 which furnishes a solution of >’ 
partially analytic with respect to w when the substitution (4.13) has been made. 
By the uniqueness theorem of §6 this solution of 2’ is unique. 


8. An existence theorem for non-linear partial differential equations 


Consider a system of n partial differential equations of the form 


(a) &; = a; ove _ i = 0, a,B,y, =1,-+-+,n 
at igloo 

b any a Wa ey" 

(b) 6G ir, oe a,b=p+1, 


in which the yg are to be determined as functions of u, v, w. We have given 
over the plane u + v = 0 the initial values y{(s, w). The functions y{(s, w) 
and their first derivatives with respect to s shall be partially analytic with 
respect to w for w = w in an interval G*. The coefficients as and b, shall be 
analytic functions of their arguments y, and dy,/dw in a neighborhood of the 
set of initial values y” and ayS”/aw. The determinant | as | shall be different 
from zero for the given initial values. We shall show that there exists a unique 
solution ya(u, v, w) of (8.1) which together with its u, v, and wv derivatives is 
partially analytic with respect to w for w = wo in a region G2 and which takes 
on the given initial values over u + v = 0. 

First we differentiate (8.1a) with respect to v and (8.1b) with respect to u, 
regarding the y, as functions of u, v, w, and solve the resulting system to obtain 


(8.2) O°Ya _ f F Ya FYa Wa Wa Wa ). 
auav *~\duaw’ dvdw’ du’ av’ aw’ ”* 


Before integrating the equations (8.2) we shall need to know the initial values 
dye’ /du and ayS?/av. We have 


ys? 
(8.3) V2 We tS 











(0) (0) 
— Ya a 


8 aussi’ 








8 It is to be noted that the argument of §5 requires that the solution of 2’ have continu- 
ous u, v, and uv derivatives. The existence of these derivatives could have been insured 
by considering the system obtained from (6.1) by differentiation with respect to u and v 
as is done in §8. However, we thought it not advisable to include this complication in 
our first existence theorem. 

It might also be noted at this point that each of the systems in §5 to which the unique- 
ness theorem of §6 is applied is of the type (6.la) with (6.1b) vacuous. For such a system 
the transformation (6.2) is unnecessary and we need not require that the coefficients aj 
in (6.1) have u and v derivatives. 
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since s = \/2u = —»/2v. In addition the initial values dy!” /du and ay” /av 
must satisfy (8.1) and making use of | as | + 0 we see that the determinant of 
the coefficients of these 2 n equations, which is of the form 























is different from zero. We designate the initial values thus obtained by 
y2(s, w) and y2(s, w). 
Again we employ a method of successive approximations 


yt 


(a) Yar ‘= ys. (—2, v, w) +[ ThFow, You: Ves Yo Yo, y’) du’, 


(8.4) 
(b) You = ys ra (Ub, —u, w) + [ SalYuw, You Yu; Yr Yrs y ”) dv’ ’ 


where for convenience the subscript a is omitted in the integrand but otherwise 


the notation follows §6. Out of these 2n equations we can calculate the func- 


tions y’., and y’,,' providing the corresponding functions of the superscript » 


remain within the region of analyticity of f.. We first show that from (8.4) 
we get a unique function y’,". If 


yar = y2(—», v, w) + r You (u', v, w) du’, 
(8.5) 


Ga = ys (u, —u, w) + [ Yav (u,v, w) dv’, 
we shall show that y’,"? = 9%". Substituting from (8.4) we have 


ye = yD (—, », w) +f yo (u’, —u’, w) du’ 


+ [ i ft pitty y’) ww du! 


(8.6) 


But from (8.3) we have 


ya (—v, », w) +f youu’, —w’, w) du’ = ys (—v, v, w) + [ uw, —u',w) 


+ Vf2 yu’, —u! ,)} du’ = y’(u, —u, w) + [ yo (—0', v', w) dv’, 


where 
repea 


are e 
t, w) 
funct 


(8.7) 
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(8.5 
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where the last integral is the result of a change of variable v’ = —u’. The 
repeated integral in (8.6) and 


[. 3 falyuw, **,¥') an} dv’ 


are each equal to the integral over the triangle with vertices (u, —u, w), (—», 


1, w), and (u, v, w). Hence y" = gi". We fix the zeroth approximating 


functions by 
(8.7) Ye = ya (—2, v; w); You and youl, —U, w); You — yor (—2, v, w). 
We employ the method of dominant functions in order to show that the 
process (8.4) and (8.5) can be continued indefinitely. First for convenience we 
take w = 0 in our hypothesis. By limiting ourselves to values of u and »v for 
which | u + » | is sufficiently small, the function y4(u, v, w) and its derivatives 
will remain within the region of analyticity of f,. Furthermore by (8.4) and 
(8.5) the function y%,(u, v, w) reduces to y.(s, w) for u + v = 0 and its deriva- 
tives yxu(u, v, w) and yro(u, v, w) to yS2(s, w) and y{)(s, w) for u + v = 0. 
Hence if we make the change of unknown 





(8.8) Za = Ya — Ya(U, v, w), 
we obtain in place of (8.2) the equations 
82a 
(8.9) a = Fa(Zuw; Zewy Suy Zvy Swy 2, U, V, w) 


with the initial conditions z{?(s, w) = 0, 20(s, w) = 0, 2{(s, w) = 0. The 


function F, is partially analytic with respect to all its arguments (except u 
and v) for the zero values of these arguments in a region G of the space of coérdi- 
nates u and v. Let us select positive constants M and R so that for values 


of u and v in G the right member of 

Za =— M 

dudv 1 n(Zuw + Low + Lu + Zy + Zw t Z) + Ww 
sy R 


will dominate the right member of (8.9). Take o = u + v and look for Z as a 
function of « and w; then (8.10) becomes 


(8.10) 








WZa _ M 


ac? : n(2Zew + 2Z,+ Zn + Z) +w 
Ke R 


with initial conditions Z, = 0 and 0Z,/a0 = 0 for s = 0. By the Cauchy 
Kowalewski theorem (8.11) has a solution Z,(c, w) which has a convergent 
power series expansion about (0, 0) and which together with its derivative 
0Z,/d0 reduces to zero for o = 0. It is evident that the coefficients of this 


(8.11) 
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power series expansion are positive. Hence, when we replace o by u + »v we 
obtain a solution Z,.(u + v, w) of (8.10) which has a power series expansion in 
u + v and w with positive coefficients convergent, say for |u + v| < y, 
|w| < 6 and which together with its u and v derivatives reduces to zero for 
u-+v=0. Let us choose a subregion G; of G which satisfies |u + v| < y. 
Also, the Z, and their derivatives satisfy the integral equations 


x. M 
Ze = d Fe 
* Lan eee 


R 











3 M ’ 
(8.12) (b) Zou Pe i n(Zuw + ves + Z) + Ww 


R 





Zaulu’, v; w) du’. 


wae | 


(ec) Za 


Let us suppose that the method of successive approximations has been carried 
out on the system (8.9) and we shall refer to the equations corresponding to 
(8.4), (8.5), etc., as (8.4*), (8.5*), etc. We now assume that 

(a) Each of the functions 2,(u, v, w); Zau(u, ¥, W); Zav(u, v, w) is partially 
analytic with respect to w for w = 0 for values of u and »v in G,. 

(8) The relations 


Zau(|u+v|, w) > Zou(u, v, w), 
Zev (|u+v|, w) > zeo(u, v, w), 


are valid in G, . 
We shall show (a) and (8) are valid with »v replaced by vy + 1. Let us use 


M 
z Tas 
1 MZ + ++ + Z) + > u(| wu + v |) w” 


R 


to designate the result of replacing the Z, and their derivatives in the power series 
expansion for the right member of (8.10) by the expansions for Za(| u + v |, w), 
etc., in powers of w. Similarly let us use 


(8.14) F.(Zuw wor z, U, V, w) = Le Bau, v)w* 
u 


(8.13) 








to designate the result of replacing the z, , --- , Zu», in the expansion for F. , 
by their expansions in powers of w. Due to property (8) and the fact that 
the right member of (8.10) dominates the right member of (8.9) the series in 
the right member of (8.14) will converge for wand vin G,. In fact we can write 


(8.15) » A, (| w+ v |) w* >> DS Bay(u, v) w" 


for u 
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ruandvinG,. Put A,(|u+v|) = ¥ Cy |u+yv| where C,, > 0. From 


(8.15) we get 
A+1 
: (8.16) if Bay(u, v’) do’ s|[ A,(| u + 0’ Da] = Dont , 
A+ 1 

From (8.12b) it follows that the extreme right member of (8.16) is the coeffi- 
cient of w“ in the expansion for Z..(|u + |, w). From (8.4b*) it follows that 
the quantity within the absolute sign in the extreme left member of (8.16) is 
the coefficient of w“ in the expansion of z’..’. In view of the above argument it is 
easily seen that properties (a) and (8) hold for v replaced by v + 1 so far as 
the function zu is concerned. Similar arguments can be given for the func- 
tions 2, and Ze». From the fact that (8.7*) are identically zero and the fact 
that any power series with positive coefficients dominates the function zero, it 
follows that (#) and (8) hold for y = 0. Hence, the process (8.4*) and (8.5*) 
can be continued indefinitely. 

We have yet to show that the functions 2, , Zau, Zav tend uniformly to con- 
tinuous limit functions. Employing the mean value theorem we get from (8.4*) 
and (8.5*) 


uf y y 
ay — Lay = [ ee (chu — Zbuw) tree + Pac _ haw, 











(8.17) ta — Sen = [ i * (Zbuw — Zeuw) + > i zs - x} - 


we —-% = (ert? — Zou) du’. 


If we let ¢, denote the set of arguments Zauw,--- , 2% in the function F, then 
each of the coefficients 8F'/af, in (8.17) is evaluated at 0% + O¢%* where 0 
and 6. are positive constants satisfying 6: + 0: = 1. Since the functions 
Zaw »°** »Z@ are dominated by Zauw(|u + |, w),---,Za(|u+v|, w) respec- 
tively for values of u and v in G; , we can obtain one function 


(8.18) ee aa 


ee 
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oF 
ae 


of u and v; m and r are positive constants. From (8.4*) and (8.18) we get 


which dominates F.(¢? | u, v, w), — (0:¢% + O¢% "| u, v, w) for the same values 


Zar KX |u+v|, 
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and in turn (8.5*) yields 





m |u+ol 
po Re 
r 


(8.19) za K 
For convenience in calculation we suppose that in addition to the conditions so 
far imposed on | u + v | that | w+ | < 1 and (8.19) can be replaced by 


7 |u + v|. 
boa 
“ 


In a similar manner we obtain from (8.17) 


ae... ae 





(dae — a0) K a \na lu + of, 
es, 
(tt. — ett) «x > utoP, = W=1,2,3,--.) 
WwW i 
G-7) 
G- &” «K so lu+o|", 
=F) 
where 
a 3nmM ,-1 Qu = @ M, =m, 
M, = b ( r +1), (u = 2,3, ---). 


If we let m’ = 6nm/r and let Gz denote a subregion of G; which satisfies in 
addition to (6.10) the inequality | w+ v| < 1, we obtain limit functions 2., Zau, 
Za, partially analytic with respect to w for w = 0 for values of u and v in Gr, 
which satisfy the equations 


Zen = 1 FalSuw; --> » 2 %, 0, OW)’, 
(8.20) San = Fa(Zuw ,°+* > U, 0, W) du’, 


u 
Ze -[ Zay du’. 
v 


Differentiating (8.20) we see that the functions z,(u, v, w) possess continuous 
derivatives z.., partially analytic with respect to w for w = 0 for u and v in G2. 
The uniqueness of the solution of the system (8.20) is established in the usual 
manner. 

The transformation (8.8) yields the set of functions ya(u, v, w) which we set 
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out to find. For these functions y.(u, v, w) satisfy (8.2) and the initial condi- 
tions. If we let ®.(u, v, w) denote the result of substituting the functions 
ya(u, ¥, W) into the left members of (8.1), it follows from (8.2) that 

0®; a®, 

=< SS 0 Kilian, GR z 
(8.21) 7 in @ 


The system (8.21) with initial conditions $, = 0 for u + v = 0 has by the 
uniqueness theorem in §6 the unique solution @, = 0. 


9. Initial value problem for a non-linear equation / = 0 


The analysis of §8 can be used to show the existence and in a sense the unique- 
ness of a solution for the initial value problem for F = 0. 
We note that the system 2 is of the type considered in §8 since the determinant 


prlee Ma 0 0 0 0 
0 prlee Mi 0 0 0 


0 0 0 ple 
rm 7 = (M11 Moe x3,)?(p? = po) ~ 0 


0 n 2 0 Le 
Se eS 2 
Ma Te 8 2 0 O| 


overu+v=0. By the existence theorem in §8 there exists a unique solution 
of the initial value problem for 2 which is partially analytic with respect to w. 
By the argument in §5 this solution of ~ is also a solution of the initial value 
problem for F = 0. Conversely, any solution of the initial value problem for 
F = 0 which admits a mapping (4.14) partially analytic with respect to w 
is unique. 
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THE RIEMANNIAN AND AFFINE DIFFERENTIAL GEOMETRY OF 
PRODUCT-SPACES 


By F. A. FickENn 
(Received August 19, 1938) 


A Riemannian geometry is completely determined by defining over a space a 
quadratic differential form ds” = gdzx"dz’, called the metric form. Let {P} 
and {Q} denote two Riemannian spaces, of dimensions p and q, with metric 
forms whose coefficients are gq and g:;. Then the product {P} X {Q} is a well- 
defined space {R} of dimension r = p+ q. A metric may be assigned to {R} 
at will, but, in order that the geometry of {R} may be accessible through the 
geometries of {P} and {Q}, a metric is suggested which depends on the given 
metrics of {P} and {Q}. If the metric of {R} has coefficients 


ga» O 
Jap = ’ 
0 gi 


then the geometry produced on {R} may be inferred in great detail from the 
given geometries of {P} and {Q}. To examine this inference is the main purpose 
of the present paper. 

A geometric object S in a product-space may be called a product-object (§2) 


if, symbolically, 
(* 0 
S= , 
0 Ss 


where S; and Sz are similar objects on the factor spaces. Thus gag is defined 
above to be a product-tensor, and T3,, Rsys, etc. are product-objects. A 
product-space has families of fundamental subspaces, each of which is a copy of 
one of the factor-spaces. Any subspace of a product-space has projections (§4) 
on these fundamental subspaces. In §§4-7 product-objects and projections are 
used to exhibit the relation between the geometry of our product space and that 
of its factors with regard to geodesic subspaces, parallel displacement, curvature, 
parallel fields of vector-spaces, and motions. These results are extended in §8 
to the product of two affinely connected spaces. An indication is given in §9 of 
the type of difficulty which arises in connection with other definitions of gas - 

It is an unsolved problem to give useful necessary and sufficient conditions 
that a Riemann space be a product-space. Little is known of product-spaces in 
the large, or of products of spaces with an indefinite metric. None of these 
problems is treated in this paper. 
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RIEMANNIAN AND AFFINE DIFFERENTIAL GEOMETRY 


All functions are assumed to have differentiability properties adequate to the 
part they play in the discussion. In a general way, notation and terminology 
jollow L. P. Eisenhart’s texts.’ 

The author is indebted to Professor A. W. Tucker for a suggestion which led 
to the study of this problem. Professors L. P. Eisenhart and M. 8. Knebelman 
have kindly examined the manuscript, and have offered many very valuable 
suggestions. For this generous and stimulating counsel the author eagerly 
extends his warm thanks. 


I. Intropuctory REMARKS 


1. Let {P} and {Q} be two topological spaces. We form the set {R} of 
ordered pairs R = (P, Q), where P is a point in {P} and Q a point in {Q}, and 
agree that the point R shall be “‘near’’ the point R’ if and only if P is “near’’ P’ 
and Q “near” Q’. Then {R} is a topological space which is called the product of 
(P} and {Q}; we write {R} = {P} X {Q}. Examples: With any of the usual 
meanings of ‘‘nearness,” the product of two straight lines is a plane, the product 
of two circles is a torus, and the product of a 2-sphere and a line segment is a 
spherical shell. 

If {P} and {Q} are manifolds of dimensions p and gq and classes u and »v, then 
{|R} is a manifold of dimension r = p + gq and of class w equal to the lesser of 
wand v. 

Except in §8, the present discussion is conducted on the hypothesis that {P} 
and {Q} are Riemann spaces ®, and ®,. Indices are assigned the following 
permanent ranges: a = 1,---,rj;a@,a=1,---,pjanda,i =1,---,qor 
p+ 1,---,7 according to context. a’ and a are indices of the first kind, a” 
and 7 of the second. It is usually convenient to use Greek indices in {R} and 
Latin indices in R, and R, . 

If P< 2x" and Q < z' are codrdinate systems for ®, and R,, then R <> x* 
is 2 coérdinate system for {R} which will be described as the product of the two 
given codrdinate systems. Such a codrdinate system will be called a code, and 
codes will be used exclusively. Every transformation of coérdinates (change of 
code) in {R} is understood, therefore, to be of the form 


-al Fie | 
=f" (x ,--+,2”) 
-a2 po f° (a?", fot , 2’). 


Equations z* = c' define subspaces of {R} which are topologically equivalent 
to the second ae tg ®, ; any such subspace will be denoted by R,. The 


equations £* + t* set up a regular one-one correspondence of class v 
between the Ry given by z* = c* and the R, given by z* =c* + ‘*. The 
equations «* = c* give an analogous system of subspaces R, . 


A manifold such as {R} is (locally) the topological product of factor-manifolds 


‘Riemannian Geometry, cited as R.G., and Continuous Groups of Transformations, 
cited as C.G. 
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in several different ways for each coérdinate system (not necessarily a code, of 
course). For the present discussion, it is desired to define a positive definite 
metric tensor gag of class w — 1 over {R} in such a way that the resulting Rie- 
mann space R, may justly be called a metric product of R, and R, . 

It seems natural to require that the geometry assigned by gag to R, shall 
induce on each subspace K, (and R,) the geometry already given on ®, (and §,). 
Thus the correspondence zg" = x™ + t* mentioned above must be isometric 
correspondences between thevarious R,. This entails that ga1g1 be independent 
of z*. If, further, coérdinates x“ in the R, and z* in 9, are so chosen that 
points which correspond to each other in an isometric correspondence have the 
same codrdinates numerically, it follows that gaig1 = ga. Similarly, we require 
Ja2s2 = gi;. These properties are invariant under change of code. 

If the components gaia: vanish in one code (and therefore g*'® = 0), then 
Jaia2 = O in any code. The special product R, with this important property is 
called the direct product of R, and R,. §§2-7 aredevoted to the direct product, 
which is by far the most interesting product.” 

If the components gaia: are unrestricted, except for the requirement that 
Jag be symmetric and positive definite, R, is called a general product of R, and R,. 


II. Tue Direct Propuct 


2. Product-tensors 


The components of a tensor fall into classes according to the superscripts 
(1 or 2) of their indices.’ The set of components selected from 7%: ::3 by assigning 
a superscript to each of its indices is called a member of 7. If each superscript 
is 1 (or 2), the member is called the first (or second) member; otherwise, the 
member is called mixed. 

Let the law of transformation for 7' be 

ee eee ee ee 
ee SSP az”! aa aah one 
If particular superscripts are assigned to the free indices a and B, the quantities 
on the left side of this equation belong to a member of 7’. If x <> #is a change of 
code, it is seen that each y and 6 may be assigned the same superscript as the 
corresponding a and 8, showing that this member of 7’ behaves, under a change of 


* The chief results already known are conveniently summarized by W. Mayer in his 
Lehrbuch der Differentialgeometrie, Bd. I1, pp. 147-152. Cf. Eisenhart, “Symmetric Tensors 
of the Second Order whose First Covariant Derivatives are Zero,” Trans. Amer. Math. 
Soc., vol. 25 (1923), pp. 297-306; and Levy, “Symmetric Tensors of the Second Order whose 
Covariant Derivatives Vanish,’’ Annals of Math., vol. 27 (1926), pp. 91-98. 

§ The operation of assigning a superscript to an index can be given explicit expression by 


1 
6% 0 
means of the tensors Ms = (3 0 and N;3 = (; e For example, To" = Ms T®. 
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code, as a tensor of the same type as 7’. This is true of each member of 7. In 
this sense, every tensor is compound. 

If a member of 7 vanishes in one code, it therefore vanishes in any code. If 
every mixed member of 7’ vanishes in a code, T' is said to be breakable. 

If a tensor 7 is breakable and the first and second members of 7' depend, in 
any code, only on variables of the first and second kinds respectively, then T 
will be called a product-tensor. gag and g* are product-tensors; also, g*™" = g” 
and ge” =g". The sum, product, and contracted product of product-tensors 
are product-tensors. 

The Cristoffel symbols 


[as, 7] = 3 (Se 4 Mar _ ue) 


— 





- ee Soe eo 


= “a = . . a —_ 
? ie 
Pe a ee ns 
3 ra ae 





$n ta ae 





ax ax8 ax? 
and 
i, - 9g” lap, y] 


also have the product character. That is: they vanish unless all indices are 
alike; if all indices are alike, they depend only on variables with indices of the Bh.) 
same kind. Also, t | pox 


SE Se a ae 


LD RT SPST I TE a oS 


















[a'B’, 1'] = [ab, cl, [a'6", 7’) = [1 ,kl, h | \ 4 
rig = TS, Posg: = Vij. | he : 





From these properties of gag and Tg , it follows that the tensors 





args O's, + Tele, — Taye My | 





R3ys 


Il 


azy—s- ax 






Rev = Jackesns 
Rgy = Reza 


are product-tensors, and that their first and second members coincide with the 
respective tensors on Rt, and t,. Also, 


R= g” Rap = g*” Ratgi tr g7 Rasp: = Rk, + kp. 







The covariant derivative of 7'g/.-. g,° is 
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If 7 is a product-tensor the right member of each of these equations vanishes, 
the remaining mixed members of 7,2 vanish because T and Ts, are breakable, 
and the first (second) member of 7’. depends only on variables of the first 
(second) kind. Hence 

THEOREM 2.1: Jf T is a product-tensor, so is Ta . 

Thus the covariant derivatives of the curvature tensor are product-tensors. 
Their first and second members coincide with the respective tensors on , 
and §, . 

If 7, is breakable, (2.2) shows that the first (second) member of 7 depends 
only on variables of the first (second) kind, so that 7’, has the same property, 
and is therefore a product-tensor. 

THEOREM 2.2: If T',. is breakable, it is a product-tensor. 

Since a vector V has no mixed components, the argument just given proves 
that, if V,. is breakable, V is a product-vector. Combining this with Theorem 
2.1, we have 

THEOREM 2.3: V is a product-vector if and only if V4 is a product-tensor. 

The product ¢* of the vectors é” of R, and n’ of R, is ¢* = (€", n®).  Simi- 
larly, the product of & and 1; is f¢ = (a1, Na2). If 2 = gast’ and i = gijn’, 
then, since gatz2 = 0, their product is connected with the product of & and 7' 
by the relations f2 = gast®, ¢* = gts. 

Any vector ¢* = (¢*', ¢*) of R, is the sum of ¢f = (¢*, 0) in the R, and 
cf = (0,¢") inthe R,. Since gatas = 0, ¢f and ¢¥ are orthogonal, so that ¢* has 
squared length | ¢ |? = | & |? + | &|?. 

If 6;, denotes the angle in R, between a pair of arbitrary unit vectors ¢* and 
n of R,, ¢ the angle between é and &, , ¢, the angle between 7 and m, 6; the 
angle between & and m , and @, the angle between £ and 7 , then 


COS Oty = gast*n’ = garpié” + Jarre” 
= |& || m| cos 0 + | & || m2 | cos 


COS gz COS d, COS 4; + sin ¢; Sin ¢, COS A. 


I 


3. General Theorems 


Before examining the detailed structure of product-spaces, we prove five 
theorems of a general nature. 

A Riemann space with metric tensor ga is flat (Euclidean) if there exists a 
coérdinate system in which ga» = 6. The product of twe flat spaces is ob- 
viously flat. 

THEOREM 3.1: A space of constant non-vanishing curvature cannot be a product- 
space. 

If R, were to have constant curvature K ~ 0, then Rasy = KTasys , where 
Tabs = Gar9ss — Ysa. But Taigey192 = gaiygers2: , Which cannot vanish for 
every choice of its indices, so that the non-breakable tensor 7 would be pro- 
portional to the breakable tensor R. This contradiction proves the theorem. 
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SE = 









i 

TuEoREM 3.2: A space R, in geodesic correspondence with a product-space R, ) 
is not necessarily a product-space. | io, 
A counter-example is furnished by a sphere, which is mapped geodesically by 1 . % 

| 

it 

i] 

| 

. 

{ 











central projection on a flat plane tangent to it. 
THEOREM 3.3: R, = Rp XK R, ts conformally flat and if and only if: 
a) when p = 1, g 2 1, R, ts of constant curvature; 
b) when p = 2,q 2 2, R, and R, have constant curvatures for which K, + Kz, = 0. 
If p = 1, ®, is flat. If g = 1 andr = 2, ®, and R, are also flat, and the 
theorem holds. 
Ifp=1,¢ =2,r = 3, R, is pay flat if and only if* 

















Rasy = Rap, “pg Rays + ae p) Gar ~ ees Jas R,) = 0. 









Since R; = Oand R = R, + Re, it follows from 2(r — 1)Raigi,z = —gaigiht,y: = 0 ne 
that R, is also constant, and the condition reduces to Rasy — Rays = 0. In®, | pene 





we choose orthogonal coérdinates, in which g” = 0, and have Ry = g"Ran = 0, i! 
whence Ra2g2,2 = 0 reduces to Rue = 0 = Rei. The first of these yields i 


C) (g” Reus) 
02? 







_ 





— 29” Rone lis = i — 29” Reuo(T 2 + Te) = 0. 


ta 





Cancelling g”, we have 









ARa12 fe Roe 0g i 
ax? -. = 
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which says that Reus/g is independent of 2’; similarly, this ratio is independent 
of x. Thus Roe = — Kg2gu , which is the condition in orthogonal coérdi- hi Gf 
nates that #, have constant curvature. Mi 
If r = 4, R, will be conformally flat if and only if the conformal curvature Hh. | 
tensor vanishes, that is, I 














1 R : i, 
sis a no ld ass = 0, ri 
(3 1) Capys Rasy r 2 7 apys (r 1)(r 2) Byé i 


where { T ars = JayRes — gas Ray + gas Ray — Gay has, 
Uapys = JasJpy — JarJss- 





Necessary conditions are found first. Caig2,132 = 0 gives 


(3.2) (r sie 3 1) (Gat+1 Reese + 9a2s2 Raiy)) + (Ri + Re) (—Gaiy' 93282) = 0. 





From this it follows, on ee by g°”, that > mm Fé 
— (p — 1)R, 
ze A) 






(3.3) Raiy = 





Jalyl, 







‘Cf R.G., p. 92. 
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and, on multiplication by g*”, that 
Re — (q-— UR 
(3.4) Rars2 = P "oye 1) p22. 


If p = 1, q = 3, then R, = 0, r — 1 = q, and (3.4) shows that 





Reese = = go . 


Thus %, is an Einstein space; since g > 2, Re is a constant. Substitution in 
Ca2g2282 = 0 gives 


R 
Ra2g24232 = ar (Ga2y2 9pr%2 — Ja2s? Jey?) 


and ®, has constant curvature. By substitution in (3.1) this necessary condi- 
tion is seen to be also sufficient. This completes the proof of a). 
If p = 2, and q = 2, either (3.3) or (3.4) shows that 


(3.5) p(p — 1)Re + g(q — 1)Ri = 0, 


and differentiation of (3.5) shows that R, and R, are constants. (3.5) applied 
to (3.3) and (3.4) show that 


(3.6) pRary = Ray , qRp252 = Regg2s2 ; 


that is, each factor space is an Einstein space. (3.5) and (3.6) are necessary 
and sufficient conditions that C.s,3 be breakable. 
When (3.6) is used to simplify the conditions C.1g1,:31 = 0, the result is 


1 2k R R 
(3.7) Rargiyis1 — a (= a : ns 2 





) U aigtyigi = 0. 


Substitution from (3.5) into (3.7) gives 


(3.8) Resginit = = a ay Usian, 
showing that ®, is a space of constant curvature K,; = R,/p(1 — p). Simi- 
larly, R, is a space of constant curvature Kz = R2/q(1 — q). (3.5) is the same 
as K; + K,=0. The conditions stated in [3.3] are thus shown to be necessary. 
Direct substitution in (3.1) shows them to be sufficient. q.e.d. 

It seems remarkable that this theorem lays no further restriction on the 
dimensions of the factor-spaces. 

THEOREM 3.4: A product-space is an Einstein space if and only if each factor- 
space 1s an Einstein space and 


S| 
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For R, to be an Einstein space it is necessary and sufficient that 


R R, + R; 
Ras = — Ja = it f, 


i ete 


Since Rag is a product-tensor, and Ri/p = R2/q implies that either equals 


(R: + Re)/(p + q), the sufficiency of the condition is evident. 
It is necessary that 


Contraction with g”" ' shows that R/p = R2/q, whence 


R, 
Rai ; = — geist 
B p B 





and %, is an Einstein space. Similarly for ®,. q.e.d. 

If R, can be imbedded in a Euclidean €,,,, but not in an ©,,, if n < p’, 
then #, is said to be of class p’. 

Let R, , R, , and R, have classes p’,q’, andr’. Letp=pt+p,g= q+’, 
define 7 = p + @, and, for the remainder of this section, let indices have the 
following ranges: 7 = 1,---,p3¢ = p+1,---,7#;i = 1,---,#a@ = 
l,-++ , pya@ =e p+ 1, oe P+ qa = 1, me »P, p+ 1, reer »pt+ 4q. 

If 
y* “its f' (a), 
y= se"), 
define imbeddings of %, in an G; and of R, in an G;, then y’ = f‘(x*) define 
an imbedding of R, in an G; » Hence r’ S p’ + q’. The possibility r’ < 
p’ + q’ is excluded by 
THEOREM 3.5: r’ = p’ + q’; that is, the class of a direct product-space is the 


sum of the classes of the factor-spaces. 
If r’ < p’ + q’, equations 


(3.8) 


(3.9) z’ = ¢' (x*) (@’ =1,---,rt+r’) 
exist which imbed R, in an &,,,". We regard &,.,, as a subspace 
(3.10) Z=c’ vV=r+r+1,---,f) 


of €;. There then exists an isometric point transformation of €; carrying the 
subspace defined by (3.8) into the subspace defined by (3.9) and (3.10). Every 
isometric point transformation of ©; preserves the quadratic form ds> = 
Di (dy’)’, and is therefore of the form 9° = aiy’ + a‘ with aj orthogonal. Let 


Oy* dy* : 
* It follows from the equations gas = = eA for the imbedding of any R, in an &; that 


an X, which has an imbedding of the form (3.8) is a direct product-space. 
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2 = aiy’ + a‘ bé such a transformation which carries (3.8) into (3.9-10). 
Then ajy’ + a’ = c’. Differentiation of this equation shows by (3.8) that 
ay" ay” 
(3.11) aj; —. = 0, ajz —“.. = 0. 
ox Ox 


Since || a; || has rank 7 — (r + r’) > O, either || aj: || or || aj2 || has positive 
rank, and it follows from (3.11) that there is then a functional relation either 
between the y* or between the y’, i.e., that either ®, is of class < p’, or R, is 
of class < q’. This contradiction proves the theorem. 


4. Geodesic Subspaces 


In this section we define products and projections of subspaces and study 
their geodesic properties. gar 
Let 2* = a*(u") (@® = 1,---,1 S p) anda’ = a‘(u") (*® = 1,--- ,m <q) 


define subspaces ft; and MR, of R, and R,. The equations a = z*'(u*’), 
a® = 2’ (u'’) define a subspace R, (n = 1 + m) of R, which is called the product 
of MR, and Ran ‘7 

Let 2* = x*(u*) (a’ = 1, --- ,n <1) be asubspace R, of R, , and let z™ = c™ 
define an R,. If || ax” /au*’ || has rank p when «® = c*, the equations 
c* = x (u*) can be solved for p of the u* in terms of the remaining n — p 
parameters, which we call w™ (a = 1,---,”2 — p). By substituting these 
p u’s in the equations z* = x2* (u*’) = 2 (w“), we find the intersection of R, 
with R, ; if, after the substitution, || az*'/aw™ || has rank l, the intersection is of 
dimension 1. The intersection of R, with R, is defined similarly. 

The projection of R, on R, is more useful for our purposes. It is the sub- 


space of R, defined by the equations 
eg” 7 a*' (u*’) 


2 ran Po 


The dimension of the projection equals the rank of || az*'/au*° || and does not 
exceed the lesser of p and n. The projection of R, on R, is defined similarly. 
The projection of &, on R, contains the intersection of R, with R,. If R, 
is a product-subspace, its intersections with the factor-spaces coincide with its 
projections on the factor-spaces, and R,, is their product. 

If P(t) is an are defined-by x* = x*(t) (0 S t¢ S 1), its projections are P(t): 
x* = 2°(t) and P,(t): 2° = 2'(t). The length of P(t) is defined formally by 


_ f'ds -f ds\\’ , (ds:\’} 
sae [[(G) +) le 
a dt (v = 1, 2) of the projections 


dt 
P,(t) both exist, for s, S s. If both s, exist, then 





1 
If s exists, then evidently the lengths s, = [ 
0 


1 1 2 2 4 
oY ds dso and [ ds dss ds, dso 3 
8 + & = [ (@ + ae) at = ‘ | (#) + (“) +2 dt | dt; 
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ds, ds. 


a a = 2 0, and s; + s: = s, so that s also 


since Jas and g;; are positive definite, — 


exists. This shows that 

THEOREM 4.1: An arc in R, is rectifiable if and only if each of its projections is 
rectifiable. 

Now let P(t) be a geodesic of R,, and for any u let i = du/dt. The x*(t) 
satisfy the equations 


TY = i*(#? + rs 3472’) — #°(@* + 15,472") = 0. 


Of these, the equations T** = 0 and T**” = 0 show that the projections 
P,(t) in R, and P2(t) in R, are geodesics. 

Suppose conversely that Pi(s:) and P2(s2) are geodesics of arc-lengths s; and 8» 
given by x* = 2x°(s:) and x = 2'(s:) in R, and R,. We seek a function 
& = f(s:) such that x*(s:) = (a*(s:), z'(f(s:))) shall move along a geodesic P(s;) 
lying in R, on the product P(s:, 82) = (P1(s:), Pe(se)) of Pi(si) and Pe(se). 
Since s, may not be the arc-length of x%(s;), we use the condition T” = 0. 
T° = O is satisfied by hypothesis. T°” = 0 gives 


dc™ (4 ao 4 r,s dx” =, (2) 4 dx™ dx ds 
ds} ty dse ds2/ \ds; 


ds, ds. dsj 


_ da” (dx - at da” = = 
d8_ A dst + Ti, 1 ds; ds, — 0. 





ds, 


Since s; and Ss are arc-lengths by hypothesis, the terms in parentheses vanish. 
d 81 ds. So 
and s. = as; + b. Direct inspection shows that this condition is sufficient for 
T“* — 0 also to be satisfied by P(s,). This completes the proof of 

THEOREM 4.2: A curve is a geodesic if and only tf each of its projections is a 
geodesic. A curve on the product of two geodesics, with arc-lengths 8, and 82, 18 a 
geodesic if and only if it has an equation of the form sz = as, + b. 

We shall say that a subspace of a Riemann space Rt, is plane (or geodesic) 
at a point P if it contains entirely each geodesic of #, which is tangent to it 
at P. A subspace is a plane (or is totally geodesic) if it is plane (or geodesic) 
at each of its points.° 

It is a corollary of [4.2] that each ®, and each ®, is a plane (is totally geodesic) 
in R,. We examine other subspaces of R, for this property. 

Normal codérdinates are useful. Let P = (Pi, P2), let y* and y’ be normal 
coérdinates at P, in ®, and at P, in R,, and consider the code which is the 
product of these codrdinate systems. A geodesic C of R, issuing from P has 
projections which are geodesics ([4.2]), and which therefore have equations 


d? 8» 
Since cannot vanish for all choices of a’ and 6, it follows that ds? = 0, 


‘ These definitions are from Cartan, Lecons sur la géométrie des espaces de Riemann, 
p. 131. 
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y° = et and y' = £'t. C then has equations y* = é*t and &* is tangent to C 
at P. This proves 

THEOREM 4.3: The product of two normal coérdinate systems is a normal code. 

Now let ®, be any Riemann space with normal coérdinates y* at a point P, , 
and let & be any p independent vectors at P:. The equations y* = 2s 
(b’ = 1,--- ,l S p) define a subspace ®, which contains entirely any geodesic 
tangent to it at P;—in other words, ®; is plane at P;. If, coriversely, 9, is 
plane at P;, we use normal coérdinates at P; , choose / vectors tangent to ®, 
at P,, and write its equations in the form y* = 2’ths. Thus a subspace 9, 
is plane at P; if and only if, when y* are normal coérdinates at P; , and £i: 
(a*® = 1, . .. , 1) are vectors tangent to 9%, at P:, it has parametric equations 
yo = 2° hs. 

At a point P = (Pi, P2) in R, we now use the normal code y* which is the 
product of normal coérdinate systems y* at P; and y' at P2. If a subspace 
R,, is plane at P, it has parametric equations y“* = 2"t2s (av = 1,---,n), 
where £%s are the vectors tangent to it at P. Its first projection ®; is y* = 
z*¢23. The Jacobian || ay*/az*° || = || &s || has rank 1; suppose that || £5 || 
(a® = 1, ---, 1) has rank J, and let at = 1+ 1,---,n. Then gs = ctit’s and 
y? = (2 + 2% cts)es. Since the és are tangent to 9%, this shows that ®, 
is plane at P;. Similarly, the other projection ®,, is plane at P2. 

Conversely, let #; and R» be plane at P; and P,. By [4.2], a geodesic C 
issuing from P tangent to R, has projections which are geodesics issuing from 
P, and P, tangent to R; and ®,,. Since these projections lie entirely in 
and ®, , C lies entirely in RF, , so that R, is plane at P. This completes the 
proof of 

THEOREM 4.4: A subspace is plane at P = (P;, Pe) if and only if its projections 
are plane at P, and P2. | 

Since a product-subspace is the product of its projections, we see that 

Coro.uary 4.41: A product-subspace is plane at P = (P1, Ps) of and only if 
its factors are plane at P; and P2. 

Combined with the definition of a plane, [4.4] gives immediately 

THEOREM 4.5: A subspace is a plane if and only if each of its projections is a 
plane. 

Corouiary 4.51: A product-subspace is a plane if and only if its factors are 
planes. 


5. Parallel Displacement 

£* at 2“ and &* + dé* at x* + de® are parallel if and only if d&* = —#T 3, dz’. 
On any one of these equations, all the indices are of the same kind. To con- 
struct a vector &* + dé“ at x“ + dr“ parallel in R, to £* at x*, we therefore 
construct vectors ¢* + dé*” (v = 1, 2) at x” + dz®’ parallel in the v™ factor- 
space tog’ at z*”. Then &* + dé" = (¢* + dt*, &* + dé”) at x* + dz“ in R, 
is the required vector. 

Parallelism along an arc has equally simple properties. Let Ro = (Po, Qo), 
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R; = (P1, Q1), and let C(Ro , R:) be an are given by x* = x°(t) (0 < S 1). 
All the indices on any one of the equations {* = —¢° I'3,<” must be of Hh same 
kind, and the equations fall into two entirely distinct sets. The displacement 
of ¢* by parallelism from Ro to Ri, may thus be effected by displacing the 
projection —* by parallelism along x" = x(t) from (Po, Qo) to (Pi, Qo) in 
the R, whose points are (P, Qo), &* " being constant, and then displacing the 
projection &* * by parallelism along x" = 2” (t) from (Pi, Qo) to (Pi, Q:) in 
the R, whose points are (Pi, Q), ag being constant. Conversely, any dis- 
placement according to this law is a parallel displacement in R,. Thus we have 

THzOREM 5.1: A vector is parallel for an infinitesimal displacement, or along 
an arc, if and only if its projections are parallel for the projections of the infini- 
tesimal displacement or along the projections of the arc. 

From this fact it is easy to show that 

THEOREM 5.2: The holonomic group of R, is the direct product of the holonomic 
groups of R, and KR, . 

The holonomic group’ at a point P (of any Riemann space) is defined as follows: 
Let C be any oriented closed curve beginning and ending at P. If a vector &* 
is carried by parallelism from P once around C and returns to a position &*, 
then £* = as(C)é°. These transformations form a group H(P) which is called 
the holonomic group at P. H(P) and H(Q) are conjugate under the group of 
non-singular linear transformations, which justifies reference to H(P) for any P 
as “the” holonomic group of the space. 

Now let C be an oriented closed curve beginning and ending at P = (P, ,P2) 
in our product space R, , and let C; and C, be its projections. [5.1] shows that 
i = ag(C)é°, where af. (C) = a$-(C,) (v = 1, 2), and as = 0 = am. This 
proves [5.2]. 

We return to infinitesimal parallelism and study the curvature of R,. If dx* 
and 6x° are differentials at x“, and £* is carried by parallelism from x* around the 
parallelogram (dx*, 5x%), it suffers an increment VE" = &R,sdzx" 62° = 
(ve", ve"). ve = & RE, ror dx”’ 52°” (v = 1, 2) is the result of displacing 
E* around a Sig ng rae? (dx*’, x*”) in the v™ factor-space. 

Letting §& , resolving vee along dx“, and normalizing the resulting 
scalar, we bide? ae siPiritans of R, for the two-spread (dx“, 52°): 


Raps dx x° dx bx° ns 
(Gay 98s — JasJey)dx* 6x8 dx? dx° 


(5.1) K = 





In studying this formula the bivector® D” = dzx* éa° — da’ ez" is useful. 
Its squared modulus is | D |? = D®Das = (Jax9ss — JasJsy) dx* bx° dx’ bx°. If 
this expression in (5.1) differs from one, we choose a pair of orthogonal unit 
vectors dx“ and 6x* in the two-spread (d, 6) and, with the new vectors, we have 


"Cf. Veblen and Whitehead, The Foundations of Differential Geometry (Cambridge Tract 
No. 29), p. 92. 
* On bivectors, see Cartan, loc. cit., Chap. I, Sec. II. 
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| D |? = D“D.g = 1. The projections of this bivector on the factor-spaces are 
D*" and D*’’. The angle ¢ between the bivectors U“ and V“ is given by 
|U |\|V| cos @ = U%Vag. Thus | D, |’ = D*'*"Daigi = cos’ $1, where ¢; is 
the angle between D® and D**'; similarly for | De |’. 

With these formulas, (5.1) may be written as 


K = Rasys dx* ba° dx” bz", 

(5.2)  K = Ratgtygi dx™ 6x" dx” bx" + Rezgry252 dx bx dx” bx” 
K = K:DeD*" + KeDa2pD*™, 

(5.3) K = K, cos’ ¢; + K2 cos’ de. 


Here K, and Kz are the curvatures of R, and R, for the two-spreads associated 
with D*” and D*”®’. (5.3) is analogous to Euler’s formula for normal curvature 
in classical differential geometry. 

There are special cases of (5.3) corresponding to the possible intersections of 
the two-spread (dx*, 6x“) with the factor-spaces. These intersections are deter- 
mined by the ranks s; and s2 of the matrices S; = | nie | and S, = - : 

| 


| | | 

| da | 

“ | has rank 2 by hypothe- 
1 

sis, 8; + 8. = 2. (dx*, dx*) will have the point P; (or P2), a one-spread, or 
itself in common with R, (or R,) according as se (or s:) = 2, 1, or 0. Now 


suppose, for example that s,; = 2 and s. = 1. Then Se may be supposed to 











s, < 2means D*””’ = 0 (vy = 1,2). Since S = 








2 . 

| a | 

have the form | _ | 
pD** = 0, Ke is undefined and (5.3) has no meaning). By considering simi- 


larly other values of s; and s2 , we verify the facts contained in 


and it follows from (5.2) that K = K;, cos’ ¢; (since 


| dx™’ || 
THEOREM 5.3: The ranks s,; and s of the matrices S; = ‘ 
| 


a? || 
wih if the intersections of the two-spread (dx*, 6x*) with the factor-spaces, 
| 
the curvature K of R, for (dx", da°), the curvatures K, and Kz of R, and R, for 
(dx* , 6x") and (dx* , dx“), and the angles ¢, and $2 in R, between these two- 
spreads and (dx*, 5x) are related according to the following table: 


x7 | 





S, = 



































| Intersection of (dz, 5x) 

a1 | & with Rp with R, K 

0 2 P, Lies in R, Ke 

2 0 Lies in R, P, Ky 

1 1 A one-spread A one-spread 0 

1 2 P, A one-spread Kz cos’ $2 

2 } 1 A one-spread P, K, cos’ $1 

2 | 2 P, P, K, cos’ $1 + Kz cos’ $2 
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The case s; = 1 = 8 gives 

CoroLLARY 5.31: A geodesic two-spread which cuts both factor-spaces is flat. 

Proor: K is defined as the Gaussian curvature of the geodesic two-spread 
tangent to the two-spread (dx“, 6x“). [4.51] shows that such a two-spread is 
totally geodesic, and [5.3] shows that K = 0 at any point at which it is geodesic, 
ie., at each of its points. K = 0 means that two-spread is flat. 
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6. Parallel Fields of Vector-spaces 


The set of vectors at a point P linearly dependent on p, linearly independent 
vectors at P is called a vector-space V,,. If a V,, is given at each point, we 
speak of a field of vector-spaces; the symbol V,, may usually be used without 
confusion to denote a field of vector-spaces or the space belonging to the field 
at a given point. 

It may happen that any vector of V,, at any point P, displaced by paral- 
lelism from P along any arc, remains a vector of V,, ; in this case V,, will be 
called a parallel field of vector-spaces. In our product-space R,, the field of 
vector-spaces tangent to the R, is evidently a parallel field; similarly for the R, . 

Let V,, be a field of vector-spaces of R, spanned by 7 (o = 1, --- , 71). The 
projection of V,, on the subspace R, given by z* = c” is a field of vector- al 
spaces V,,(a™), where p, is the rank of the matrix || & ‘(c*’, c*) ||; similarly 
for V(t”) on the R,. Suppose now that V,, is a parallel field of vector- | 
spaces. In the R, given by z® = c™, let a vector * of V » (a) be displaced Hes 
by parallelism in R, from P to P’ along an arc lying entirely in R,. By [5.1], } |B 
“ moves also by parallelism in R, , and &* moves by parallelism in R,. By HY | Paeoa 
hypothesis, ¢“ stays in V,, , so that é*’ stays in V,», , Which is thus shown to be a 
parallel field of vector-spaces in R, ; similarly for V,,. If, conversely, each He 
V,, and each V4, is a parallel field of vector-spaces, then V,, is a parallel field . 
of vector-spaces. In other words, 

THEOREM 6.1: A field of vector-spaces in R, is parallel if and only if each of | 
its projections is parallel. i ia 

If V,, is the product (direct sum, or join) of V,, and V4, , it follows that i 

CoroLLary 6.11: The product of two fields of vector-spaces is a parallel field I 
if and only if each factor is parallel. | ih 

An absolutely parallel vector-field is equivalent to a parallel Vi. If &* is 
absolutely parallel in R,, then 

ag ag 


1 a2 
te: = a 0, fa. = ro Shes 0, 
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and “ is a product-vector. This proves 

CoroLLary 6.12: A vector of R, is parallel if and only if it is the product of 
two parallel vectors. 

Corotuary 6.13: If Rp and KR, have p, and q independent parallel vectors, 
then R, has ry = p, + q, independent parallel vectors. 

The chief known result on product-spaces is 

TuEoreM 6.2: If a Riemann space R, has a parallel field of vector-spaces V >», ; 
and p, < p, then ®, is the direct product of an Rp, and an Ryp-p, « 
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This theorem and its proof are to be found in Mayer, loc. cit. The spaces 
V,, are tangent to the subspaces R,, , and the spaces V,_», of vectors normal 
to Vy, are tangent to the subspaces R,_,, and also form a parallel field. 

Let 9, have p; independent parallel vectors. They span a parallel field V,, 
of vector-spaces. We apply [6.2] to 8, and then again to ®,, , in which each 
parallel field is equivalent to a parallel vector-space V; , and have 

CoROLLARY 6.21: An ®, with exactly p, independent parallel vectors is the 
product of a flat €,, and an Rp», with no parallel vectors. 

This corollary furnishes a useful normal form for a product-space with parallel 
vectors. Let ®, and ®, have p, and q parallel vectors. Then ®, = 
Ep, X Rev, , Ra = Ey XK Rog, , and R, = (€,, X €q,) K (Ryo: KX Rea) = 
GX Rao,- Ry», , Reg: , and R,-,, , which have no parallel vectors, will 
be called the irreducible components of R, , R,, and R,. The dimension of an 
irreducible component is not less than two. 

A parallel field of vector-spaces is called simple if it contains no parallel field 
of proper subspaces. In a space R,, let V, be a simple parallel field of vector- 
spaces. By [6.2], R. = Rp X RM, (¢ =r — p). Mp, cannot be factored into a 
product of two of its subspaces, for V, would then not be simple; we call ®, 
simple. To prove [6.3] we need 

Lemma 6.31: If a Riemann space R, has a simple parallel field of vector-spaces 
V,, any parallel field of vector-spaces either contains V, or lies in the field V, 
(q = r — p) of spaces of vectors normal to V, . 

Let &° (9 = 1, ---,8 <r) span a parallel field of vector-spaces W, which does 
not contain V,. W, cannot intersect V,, for the intersection would be a 
parallel field of proper subspaces of V, , contradicting the assumption that V, 
is simple. Hence the equations s°> * = 0 can have no solutions s° except 0, 
and therefore s < q, and || £*’ || has rank s. Either W, lies in V,, or, since 
V, is simple, the projection of W, on V, must be of dimension p, so that s 2 p. 
If q < p, this contradiction proves the lemma. If q = p, then ||£*" || has rank p, 
and there is a p for which &, # 0. 

A parallel field of spaces Z, will be constructed which intersects W, in a 
parallel field of spaces whose projection on V,, by virtue of [6.1], will be a 
parallel field of subspaces V,_, ; this contradiction will prove the lemma. Let ¢ 
be a constant differing from zero and from one, let ¢} = cf, and ¢* = & (a > 1), 
and let Z, denote the field of spaces spanned by ¢*. It is shown by Mayer 
(loc. cit.) that the field of spaces spanned by £* is parallel if and only if func- 
tions is exist for which 7, = yi. Then (1, = céhg = cytete = voete, 
and, when a > 1, f)6 = fn8 = Yoste = Yeates , OF, for all a, S58 = Yoase 5 Ze is 
thus a parallel field of spaces. Since || ¢7° || = || €@” || has rank s, and s < 4, 
the system (’¢* = ¢@ = s’*" | for given s’, has unique solutions ¢? = s°, and, 
if  ~ 0, then é’ ¥ t’¢} = cs’t} = ct', so that £* is not in Z, ; similarly, W, does 
not contain Z,. W, and Z, intersect in the parallel field of subspaces given by 
t' = 0 = ¢', whose projection on V, is the parallel field of proper subspaces 
Vp given by ¢' = 0. This contradiction proves the lemma. 
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It is now possible to prove 

THEOREM 6.3: A Riemann space R, is a direct product of simple factor-spaces. 
Two decompositions of R, into simple factor-spaces differ at most in the order of 
the factors. 

Let R, be a Riemann space. Its decomposition into flat and irreducible 
components (cf. Corollary 6.21 ff.) is evidently unique, so no generality is lost 
in assuming that R, is irreducible. 

Either R, is simple, or the field V, of its tangent vector-spaces has a simple 
parallel field Vp, of proper vector subspaces. Then R, = ®,, X My, ( = 
r— pi), and ®>, is simple. Either §,, is simple, or R,, = Rp, X Ry, (Ge = 
(1 — p2), Where ®,, is simple, and so on. At each step, the dimension of the 
unfactored component §R, is lowered by at least two, and we arrive finally at a 
decomposition of R, into a direct product of simple factor-spaces: R, = 
Ny, X--: X Rp, , with p, + ++ +m Sr. 

Let R, = Ri, X-++& Ri, , with 8: + --- + 8; = 7, be a second decom- 
position into simple factors. The simple parallel field V‘, of spaces tangent 
to R;, contains the simple parallel field V,, of spaces tangent to R,, , or, by 
the lemma, it lies in V,,. If V:, contains V,,, they coincide, for otherwise 
V:, would have the parallel field of proper subspaces V,, , and would there- 
fore not be simple; in this case Ri, = Rp,- If Vi, lies in V,,, then Vi, either 
contains V,, (and, as before, coincides with it), or it lies in V,,. Proceeding 
in this way, we either identify 9;, with one of the ®,, or find that V:, lies 
in V,,. If Vi, and V,, did not then coincide, V,, would not be simple. In 
any case, V;, coincides with one of the V, , and i, coincides with one of the ®, , 
which we interchange with ,, , and write the second decomposition as R, = 
RN, X Ri, X ++» & Ri,. The space Vj, of vectors tangent to Rj, = 
Ki, X ++» X Ri, consists of the vectors normal to V,, and therefore coincides 
with V,,. Also, pe +--- + pe = Gi = Se +++: + 8. We now treat the 
spaces Io, = 9;, similarly, identifying 9:, with a factor of R,, , rearranging 
the factors R, , if necessary, so that 9;, coincides with ®,, , and finally have 
R., = Ri, with pp + --- +m = G@ = 8 +--- +8. If k were to exceed l, 
a stage would be reached, after 1 — 1 steps, when 2), = Re-, = Re-. = 
Rp, X ++» X Rp,. But then the simple parallel field of spaces Vi, would 
contain the parallel fields of proper subspaces Vy, , --- , Vp, , contradicting the 
assumption that Rt, is simple. Hence k <1. Similarly,! < k. Hence k = I, 
the process terminates automatically, and ®,, = R>,. The proof of [6.3] is 
now complete, for the first factorization has been so rearranged as to be identical 
with the second. 

In terms of vector-spaces, [6.3] may be stated as 

CoroLuary 6.31: Every parallel field of vector-spaces is a product of simple 
parallel fields of vector-spaces. ‘T'wo decompositions differ at most in the order of 
the factors. 

We call two vector-spaces normal if each vector of either is normal to every 
vector of the other. The lemma may be strengthened as . 
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Coro.uary 6.32: If two parallel fields of vector-spaces do not intersect, they 
are normal. 


7. Motions 


It is seen immediately from Killing’s equations fs + &§. = 0 that the 
product of two motions is a motion, and that, if a product-vector is a motion, 
then each factor is a motion in its factor-space. If R,, R,, and R, have groups 
of motions G,,, Gr, , and G,, , it follows that 7; 2 m + m. 

This lower bound can be refined by using the normal form mentioned in 
connection with [6.21]. If R,_», has a G,;, then m 2 wi + 43pi(pi + 1); simi- 
larly for Q,-¢,- Hence ®,--, has a Gj, with m3; 2 7; + mm, and R, = 
G,., X Ri, has G,, , with 


nant 
Sit + Me tD 
= rit + TY 4 MOTD tora. 


In this sum the term pq, counts up those rotations of €,, which are not 
merely products of rotations of ©, and G,, . 

It will be shown that the equations hold in each case, i.e., that 

THEOREM 7.1: 73 = m + m2 + Pi. 

If &* is a vector-field, the infinitesimal transformation #* = x* + £*6t will 
be a motion if and only if 


(7.1) fa, + fa = 0. 

For any vector-field, 

(7.2) baty — tars = teRapy- 

From (7.1), (7.2), and the cyclic identity for R, it follows that 
(7.3) katy = SR ya - 


For &* to be a motion, £.,3 must satisfy this differential equation.’ 

(7.3) shows that &,s, is breakable. By [2.2], it is a product-tensor. Hence 
the first (second) members of £4, depend only on variables of the first (second) 
kind, but it is not necessarily breakable. If £,s is also breakable, & is a product- 
vector and therefore the product of two motions. 

If é.,¢ is not breakable, its mixed members satisfy equations (7.3), of which 


(7.4) et + &y1,02 Tips = 0 
x 





* Cf. C.G. §53. 
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is an example. Now, for a fixed value of a’, and for a change of code, the 
quantities 41,22 are components of a covariant vector on each R,. (7.4) shows | Hi te 
that this vector is a parallel field in each R,. Similarly, the vector £2.41 | 
(a' fixed) is simultaneously a parallel field in each R,. Further, if either of 
these fields is a zero field, it follows from (7.1) that the other also vanishes 
and é is the product of two motions. If either Rt, or MR, fails to have parallel 
vectors, it then follows that there are no motions of R, except product-motions. 
In this case G,, is the direct produtt of G,, and G,, , and 7; = m + m. 

Now let R,, R, , and R, have parallel vectors and be given in normal form. 
Since Rp-p, has no parallel vectors, Gz, = Gr; XK Gip.p, 41, and m = m + 
bo(pr. + 1); similarly for Ry - Similarly, R, has a G,,, with ™ = + 
dn(ri + 1), and ao = wT + T: » SO that T3 = ani(r; + 1) a wT! + wT = 7) + 
m+ pig. qg.e.d. 

The argument also proves that 

THEOREM 7.2: G,, 1s the direct product of a Gir(r41 and aG,;. The group 
G,; of the irreducible component R,-,, = Rp—p, XK Req, 18 the direct product of 
G,, and G,;. 

It is thus a simple matter to write down the symbols and constants of compo- 
sition of G, when those of G,; and G,; are known. 

A motion is a translation if the trajectory of each point is a geodesic. Since I | 4 
the translations of a flat space offer no difficulty, we deal with the irreducible i thes 
component ¥,-,, = Rp—p, X Rye, of our product-space. | Haeic 

If ¢* is a translation of Rp», along geodesics C;, and £° is a translation of f 
Nog, along geodesics C2, and &f = (€* , 0) and é = (0, £*’), then &* = réf + ut I 
isa motion of 9,_,, whose trajectories C have projections which are the geodesics ih 
C, and C;. It follows from [4.2] that the trajectories C are geodesics and §* ii | 
is a translation of R,_,, Conversely, if ¢* is a translation, it is the product of 1 
motions £* of Rp», and £' of R,-,, whose trajectories C; and C; are the projections i 
of the trajectories C of £*, which are geodesics. By [4.2], Ci and C2 are geodesics, i 
and é* and €' are translations. This proves 

THEOREM 7.3: A motion of R,-,, is a translation if and only if it is the product 
of a translation of Rp», and a translation of Raq, - 

An immediate consequence is He 

Coro.uary 7.31: R,_,, has a G,, of translations if and only tf Ry—p, has aG,, 
of translations, and Ry, has a G,, of translations, and G,, is the direct product of 
G,, and G,, . 
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III. Tae Direct AFFINE PRopvuct 


8. The Direct Affine Product 


We return to our product-manifold {R} = {P} x {Q} (§1), and suppose now 
that {P} and {Q} are affinely connected manifolds %, and Y%, . 













Cf. C.G. §48. 
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According to the superscripts of their indices, the components Aj, of an 
affine connection in {R} fall into classes which will be called members. An 
inspection of the law of transformation 
ax gg Ox an" | 2? 
age — 9" aap aay | anton 
of Aj, for a change of code shows (as was seen for tensors in §2) that each member 
behaves as an affine connection; we say that As, is compound. For a mixed 
member, in addition, the second derivative in (8.1) vanishes for a change of code, 
so that each mixed member is a tensor under a change of code. A connection 
will be called a product-connection if it is breakable in a code and if its first and 
second members depend, respectively, in any code, only on variables of the first 
and second kind. 

Let %, and %, have connections Af, and Aj,. In {R} we define Afi,: = A¥. , 
Agey2 = Aix, and As, = 0 if two of its indices are of different kinds. We call 
{R} with the product-connection Aj, the direct affine product of A, and %, and 
denote it by A,. Many of the theorems for the direct metric product depend 
only on the product-character of the affine connection, and can be extended 
easily to the direct affine product. 

The paths x°(t) of A, are solutions of the equations 2°(#° + I%,@72°) — 
af (@* + %5¢7%°) = 0, where 1%, = 3(A5s + Aj,) is the symmetric part of Aj,. 
The proof of [4.2] may be repeated exactly, with “geodesic” replaced by “path,”’ 
and “‘arc-lengths” by “affine parameters.” 

The definitions of “plane at a point” and “plane” are valid if “geodesic” is 
replaced by “‘path.’”” Evidently the subspaces A, and A, are planes in A,. 

An affine normal coérdinate system y at a point P is one in which the path 
through P in the direction ¢* has equations y* = £*t. With this understanding, 
[4.3] holds for affine normal coérdinate systems. The proofs of [4.4] — [4.51] 
can then be repeated exactly with “geodesic’’ replaced by “path.” 

Parallel displacement with respect to Ag, between neighboring points or along 
a curve has the product-character expressed in [5.1], and [5.2] therefore holds for 
A,. These theorems hold also for displacements parallel with respect to Ty. 
The study of curvature leading to [5.3] fails for want of a metric tensor. 

Theorem 6.1 and its corollaries carry over without modification. Mayer’s 
proof of [6.2] for an , depended on the fact that the family of spaces Vp», of 
vectors normal to a V,, is a parallel field of vector-spaces if Vp, is. In an Y, 
there is no such unique vector-space complementary to a V,, , and this method 
of proof of [6.2] is not valid. No direct proofs of extensions of the remaining 
results of §6 have been found. 

The infinitesimal transformation #* = x2“ + £6t will be a collineation of A, 
(will carry paths into paths) if and only if” 


Ey _ Bsa ’ 


Op; + 5, Es + D5 Ey io QB y Es = 0, 


(8.1) AS, 


(8.2) 





"Cf. C.G. §58. 
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where 23, = 3(Ag, — Ag), the semi-colon indicates covariant differentiation 
with respect to T'3,, and Bg,, is defined in terms of I's, precisely as R$,; was in 
terms of the Christoffel symbols ['s,. By virtue of the product-character of 
As, (and hence of T's, , 23,,and B;,s), it follows from (8.2) that the product of a 
collineation of %{, and a collineation of Y%, is a collineation of A,. 

We prove a weakened extension of Theorem 7.2. 

TurorEM 8.1: If either A, or A, fails to have a parallel vector-field, the group of 
collineations of A, ts the direct product of the groups of collineations of A, and A, . 

If £* is a collineation of A, which is not a product of collineations of %, and Y, , 
it follows from the first of (8.2), as in the proof of [7.2], that &3: is a parallel field 
in A, (8° fixed) and in A, (a’ fixed); similarly for ¢3i ._ If, then, either %, or A, 
fails to have a parallel field, each of these fields vanishes. This means that 
tn = a&"/ac” = 0 and &$i = at /ax™ = 0, so that £* is a product of two 
collineations, and the group of A,is the direct product of the groups of %, and 
%,. q.e.d. 

The more exact results of [7.1] and [7.2] cannot be extended without more 
definite information on an Y%, with parallel fields. 

A collineation may be called a translation if the trajectory of each point is a 
path. [7.3] and [7.31] depend only on [4.2]. Using the extension of [4.2] to an 
A, and replacing “motion” by “collineation”, the proofs of [7.3] and [7.31] may 
be repeated exactly for the case when one of the factor-spaces has no parallel 
fields. 


IV. RemMArRKs ON GENERAL PRODUCT-SPACES 


Rk, = Ry X R, is a general Riemann product-space if gaia: ~ 0. A two-space 
which is a general product will have ds’ = du’ + dv” + 2F(u, v) dudv with F’ < 1. 
It is trivially easy to show that, if R, is in one-one isometric or conformal cor- 
respondence with a product-space R, , then R, is a product-space which is direct 
if and only if R, is. 


9. General Product-spaces 


The peculiar simplicity of the direct product followed from the product- 
character of gag, Igy, etc., and the simple behaviour of product-tensors. In 
attempting to apply the methods used in studying the direct product, it is 
natural to try to restrict gale: SO as partially to preserve the simplicity of the 
direct product. Each such effort has been found to lead back to the direct 
product. Since these results are mainly negative, they will be summarized 
without proof. 

The complication of gag makes itself felt at a very elementary level. In an 
ordinary Riemann space §, , it is usual to speak of é* and associated functions 
t = gat’ as the contravariant and covariant components of the same vector &. 
If and » are vectors in R, and ®,, we define their contravariant product {7 
to have components ¢* = (¢“, 7”), and their covariant product {v4 to have com- 
ponents {ye = (1, Na2). It is easy to show that fra = Japs , that is, the covari- 
ant and contravariant products of two factor-vectors coincide in R, , if and only 
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if R, is the direct product of R, and R,. Further, the contravariant (cova- 
riant) product of a given pair of vectors will necessarily be also the covariant 
(contravariant) product of some pair of vectors if and only if R, is direct. 

The length of the elementary vector ¢; = (¢* , 0) calculated in R, obviously 
agrees with its length calculated in R,. It can be shown that fi = (&1 , 0) 
has the same length in R, as in R, if and only if R, is direct. The contravariant 
product of two unit vectors has squared length | ¢; |” = 2[1 + cos (, n)]. Their 
covariant product {, will obey a similar law if and only if the lengths of ele- 
mentary covariant vectors are preserved, i.e., 2, is direct. 

The difficulties encountered with covariant product-vectors may be removed 
in exchange for similar difficulties with contravariant product-vectors in the 
following rather artificial way. Instead of using ga» and 93: and defining further 
functions gataz, we use g” and g”, denote them by h*” and h®® and define 
further functions h”” # 0 in mache a way that h“ is symmetric and positive 
definite. In this way the manifold {R} is again made into a Riemann space 
which may be called a general contravariant product R’ of R, and R,. The 
remarks above on R, may then be paraphrased in an obvious way for R’. Fur- 
ther, we may define an (R,)’ associated with R, by taking h*™ to be the functions 
g* ~ occurring in the inverse g“ of gas in R, ; similarly, a given R’ has associated 
with it an (R’), constructed by replacing hag: and ha2g2 in the inverse has of h*” 
for R’ by gaigi = gab ANd Ja2g2 = gij;. It is easily shown that these definitions are 
legitimate in the sense that if the metric of R, (or of R’) is positive definite, then 
so is the metric of (R,)’ (or of (R’),). It can be proved that h® for (R,)" equals 
g” for R, if and only if R, is direct; similarly for R’ and (R’),. Finally, ((R,)’), 
= R, if and only if R, is direct. 

In the direct product, it was advantageous to know that the product of two 
motions or of two parallel vectors was a motion or was parallel. When the 
necessary conditions for the truth of these statements in a general R, are required 
to be satisfied by arbitrary product-vectors, it is seen that the contravariant or 
(independently) the covariant product of two motions or of two parallel vectors 
is necessarily a motion or a parallel vector if and only if R, is direct. This 
completes our remarks on vectors. 

In a general product-space, under a change of code, tensors are compound. 
I's, is compound, and its mixed members are tensors. The first (second) mem- 
bers of [8y, a] depend only on variables of the first (second) kind, but I's, does 
not necessarily have this property. The subspaces R, and R, need not be totally 
geodesic in R,. The projections of a geodesic are not necessarily geodesics 
of the factor-spaces, so that the product of two normal coérdinate systems is 
not necessarily a normal code. Our earlier results are thus incapable of im- 
mediate extension. 

By examining in detail the equations for parallelism along a curve, separate 
interpretations can be read off for each of the various members of T3,. By 
using §6, it can be shown that if gaia: are so defined in R, that T's, is breakable and 
Igy. = 1%, and I'gey2 = jz, then R, is direct. 
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In spite of the awkwardness of general product-spaces, sufficient conditions 
can be given that an arbitrary Riemann space R, be a general product-space. 
Suppose that a codrdinate system exists in which the coérdinates can be so 
grouped that, when transformations of each group separately are alone per- 
mitted, P'giy2 = 0; then R, is a general product-space and this codrdinate-system 
isacode. Finally, if R, is a Riemann space with a transitive group of motions 
(, which is the direct product of a G,, and a G,,, then R, is a general product 
of an Qty, and an ®,, , 7: and r2 being the ranks of the matrices of the symbols of 


G,, and Gr, . 
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NON-ALTERNATING INTERIOR RETRACTING TRANSFORMATIONS 
By G. T. WHyYBuRN 
(Received April 24, 1939) 


1. Introduction 


It is well known that any topological n-cell, and hence certainly any simple 
are, is a retract of any containing topological space; and any topological poly- 
hedron, in particular any simple closed curve, is a retract of some of its suitably 
chosen neighborhoods in any containing space. When anything more than 
continuity is required of the “retracting” transformation, however, the situa- 
tion is obviously markedly altered and yet it seems not to have been the subject 
of much investigation. 

In this paper conditions will be developed under which a locally connected 
continuum M can be retracted onto an arc or a simple closed curve by trans- 
formations of the type indicated in the title. The principal results are: (1) for 
M to be retractable into an arbitrarily given arc axzb by such a transformation 
it is necessary and sufficient that M be identical with the cyclic chain C(a, b) 
in M; and (2) for M to be retractable into a given simple closed curve J by 
such a transformation it is necessary and sufficient that M be cyclic and not 
unicoherent about J. In the proofs, no use will be made of the facts con- 
cerning retracts mentioned above. 

It will be recalled that a continuous transformation f(S) = R is said to be 
retracting (and R is then a retract of S) provided R C S and for each ze R, 
f(z) = x. Also, a continuous transformation 7(A) = B is said to be interior’ 
provided sets open in A map into sets open in B, and T is said to be non- 
alternating’ provided that for no two points z, y ¢ B does T'(z) separate two 
points of 7” *(y) in A. 

Throughout the paper M will denote a compact locally connected continuum. 
For any two points a and b of M, the cyclic chain C(a, b) in M is the set con- 
sisting of a, b, the set K of all points separating a and b in M, and all cyclic 
elements of M containing two points of K + a + b. More simply, C(a, b) 
consists of all simple ares in M of the form azb. 


2. Separation and subdivision 


(2.1) Lemma. Let M = C(a, b), let A and B be disjoint continua in M con- 
taining a and b respectively and let a’xb’ be any arc in M with a’xb’.A = a’, 





1 See Borsuk, Fundamenta Mathematicae, vol. 17(1931) p. 152. 
? Stoilow, Annales Scientifiques de l’ Ecole Normale Supérieure, vol. 63(1928) pp. 347-382. 
* See my paper in American Journal of Mathematics, vol. 56(1934), pp. 294-302. 


914 


a'zb’ «. 


M bet 


then . 
estab! 


where 
and J 
taker 


wher 
is sat 

Le 
subd 
duci 


whel 


wh 


fre 











NON-ALTERNATING RETRACTING TRANSFORMATIONS 915 











a'tb’-B = b'. There exists a set X with X.a'xb' = x which irreducibly separates 
M between A and B into just two components. 

Proor. If « separates A and B in M, we have only to set X = x. If not, 
then M — zx is connected and locally connected. Hence by a result previously 
established’ we can find a decomposition 


M-2z=R+F+R,, 


where R, and R, are disjoint connected open sets containing A + a'r — x 
and B + xb’ — x respectively and where F = F(R.) = F(R) (boundaries 
taken relative to M — x). Thus if we set F + x = X we have 


M-X=R+ kh, 


where F(R,) = F(R.) = X (boundaries taken relative to M), and our lemma 
is satisfied. 

Let the locally connected continuum M be a cyclic chain C(a, 6b). By a 
subdivision” « of M will be meant a finite, linearly ordered set of disjoint irre- 
ducible cuttings of M between a and b 


a= Xo, X1, X2, X3,--+, Xn, Xan = J, 


where X; (1 S ¢ S n) cuts M into two connected sets M.(X;), Mo(X;) so that 
M,(X: Da+ >)" X; and M,(X;) Db + Don X;. The set 


Xia + Xi + Mi(Xi)-Mi(Xin) = 1: 


will be called the internal from X;_; to X; (a = Xo, b = Xnii). A set of the 
form M,(X;)-Mz(X;-1) will be called an open interval of ¢. A subdivision o’ 
will be called a refinement of o provided it is obtained from o by inserting addi- 
tional elements. (The sets X; are called elements of the subdivision.) 

Still assuming M = C(a, b), let azb be any arc in M fromatob. We proceed 
to prove 

(2.2) Lemma. Given any subdivision oo of M each element of which contains 
just one point of axb and any « > 0, there exists a refinement o of oo such that if 
X; is any element of o, then axb-X; is just one point and’ V.(X;) DI; + Tins. 

Proor.’ For convenience we will suppose the metric in M so chosen’ that 
all sets of the form V,(x), x « M, are connected. Let e = ¢/5 and let us cover M 
with a finite number of the sets V.(x), say Vi = V.(11), V2 = Ve(a2), ---, 
Va a VAte). 

Now consider V,. If V; > b we need go no further. Hence we suppose 
Vi-b = 0 and let Xiu: be the first element of o» such that M.(Xin) > v:. 





‘See my paper in Bulletin of the American Mathematical Society, vol. 37(1931) p. 734. 

* Compare with my paper in the Duke Mathematical Journal, vol. 5(1939), pp. 647-655, 
where results very closely related to (2.2) and (3.1) are established. 
, * V(X) denotes the r-neighborhood of X, i.e., the set of all points z at a distance < r 
rom X, 
"See Mazurkiewicz, Fundamenta Mathematicae, vol. 1(1920), p. 27. 
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Now if Voe(1)- Xi: * 0 we need go no further. If this is not so, we proceed 
as follows. Setting a = M.(X;:) + Xi, 8 = Mo(Xins) + Xiz: and letting 2’ 
be a point of V; in the open interval (X; , Xi+1) of oo , we can find® a continuum 
ax’B in M which is a “simple are” between the continua a and 8, i.e., if a and 8 
are shrunk to points we have an ordinary simple are going through z’. 

We shall treat first the case where axzb-V2-(21) = 0. On az’B proceeding 
from x’ toward @ and from 2’ toward 8 let a’ and #’ respectively denote the 
first “points” of the set a + B + azb. 

(i) If a’ # a and Bp’ ¥ 8B, let c be the first and d the second of the points 
a’ and 6’ on azb in the order a, b and let EF = are cz’ of ax’B, F = are x'd of ax’. 

(ii) If a’ ¥ a and p’ = 8B, let c = a’, d = axb-Xis1, E = are cx’ of az’, 
and F = 2’B of az’B. 

(iii) If a’ = a and fp’ $ 8B, let c = axb-X;, d = B’, E = az’ of az’B and 
F = 2’'d of az’B. 

(iv) If a’ = aand pf’ = 8B, let c = axb-X;,d = axb-Xix1, E = az’ of az’s, 
F = 2’B of az’B. 

In any case, let y be the last point of V2.(21) on a’x’B’ in the order a’, B’ and set 


A=a+ac(ofar)+E+ 7, 
B = B + db (of axb) + F — (2’y — y). 


Let x be a point on azb between c and d. Since crd-A = c, crd-B = d, we 
may apply (2.1) and obtain a set X with crd-X = x which separates M irre- 
ducibly between A and B into just two components. 

Now in case azb- V2.(%1) ¥ 0, let d be the last point of V2e(z ) on azb in the 
order a, b and let x’ be chosen as before. We can either join x’ to ad — d by an 
are px’ C M,(Xi4:1) such that px’-axb = p or we can join x’ to X; by an arc pr’ 
such that p e X; and px’-azb = 0. If p € azb, set c = p; if not, set c = X;-azb. 
Let z be between c and d on azb. Then if we set A = a + ac + pz’, B = 
B + db, we can proceed as before to find the set X. 

Clearly X is on the “b side” of Vi, ie., Ma(X) D Vi; and X-V2,(x1) ¥ 0 
since A -Vo.(%1) 0 ¥ B-V2,(%1). In exactly the same manner we construct a 
set Y on the “a side” of V; such that Y-V2(a1) ¥ 0. Let us add X and Y 
to oo and call o; the resulting refinement of ao. 

Similarly for V2 we obtain a refinement o2 of o; which retains the properties 
of o; and in addition contains two elements W and Z each intersecting V2.(z2) 
and such that V2 C M,(W)-M,(Z). Proceeding in this manner to Vm we 
finally obtain a refinement om of oo , which we call o, which retains the essential 
properties of oo and in addition has the property that for any i S m, there 
are two elements X; and X;, of o intersecting V2,(r;) and such that Vi C 
M,(X;)-M.(Xx). Since e = ¢/5 and the sets V; cover M, clearly this is equiva- 
lent to our lemma. 





§ See (1.1) of the paper referred to in °. 
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3. Retractions into arcs 


(3.1) THEOREM. In order that a locally connected continuum M be retractable 
into an arbitrarily given arc axb in M by a non-alternating interior transformation 
itis necessary and sufficient that M be identical with the cyclic chain C(a, b) in M. 

Proor. We first show that the condition is sufficient. By lemma (2.2) we 
can find a subdivision o; of M satisfying the conclusion of this lemma for « = 1. 
Let us set o1 = o; and suppose we have constructed o; and a; for all i < n. 
Then let us choose points 2 , 22, --- 2, on axb such that the diameter of each 
of the arcs Ga, %1%2,--- ,%bis < 1/n. Then by lemma (2.2), we can find a 
refinement o, Of on-1 satisfying the conclusion of that lemma for « = 1/n; 
and by applying lemma (2.1) successively to the points 2, x2 --- 2, we can 
obtain a refinement o,, of o, such that each of the points [x;] belongs to exactly 
one element of o, and such that ¢, still satisfies the conclusion of lemma (2.2) 
fore = 1/n. To see this, consider z;. Suppose 2; lies within the interval 
YZ of co, (if x; belongs to an element of co, , we need make no construction 
forz;). Then axb- YZ is an interval cz,d of axb; and if we set M,(Y) + Y = A, 
MZ) + Z = B and apply lemma (2.1), we get a set X; with azvb-X; = 2; 
which separates M irreducibly between A and B into just two components. 
If we add such a set X; to o, for each 2; not on an element of o, , we obtain 
the required subdivision o, . 

Thus we have set up an infinite monotone sequence o;, o2,--- such that 
for each n, o, satisfies the conditions of lemma (2.2) for « = 1/n and in addition 
such that each interval J = X;Xi4: of o» intersects arb in an arc of diame- 
ter < 1/n. 

We now define a transformation f(p) of M into azb as follows. If p belongs 
to some element X, of o, for some n, let f(p) = azb-X,; if not, then let 
Z, = [[?JI,, where J, is the (minimum) interval of o, containing p and let 
f(p) = axb-Z,. 

Since for each n, J,,-axb is an interval of axb of diameter < 1/n, it follows 
that in either case f(p) is a single point so that f is single valued. Obviously 

f(p) = p for p eazb so that f is retracting. Furthermore, since for any point p 
other than a or b and any e > 0 we can find an open interval FE = (X;, Xx) 
of some o, containing p whose intersection with azb is an open interval (z ;xx) 
of axb of diameter < ¢ and since f(Z) = (2,2), it follows that f is continuous. 
Finally, since for any x eaxb — (a + b), it follows by the construction of the 
subdivisions o, that f‘(x) separates M irreducibly between a and 6 into just 
two components R, and R, which map under f onto the ares az — xz and xb — x 
respectively, and since a = f-'(a), b = f(b), it is readily seen that f is non- 
alternating and interior. 

To show the condition necessary,’ let f(M) = azb be a non-alternating, 
interior and retracting but suppose, contrary to our statement, that there is a 
component R of M — C(a, b). The boundary of FR is a single point z. If 
« = a (or b), then since f is interior, f(R) > azb — aso that R contains apoint 
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of f ‘(b); but since R does not contain 6b, this contradicts the fact that f is non- 
alternating. If a + x + b, then since f is interior, {(R) must contain either 
ax — x or xb — zx; but since R contains neither @ nor b, again it follows that f 
alternates. 

Remark. Attention may be called to the fact that the transformation f 
set up in the sufficiency part of the proof is so defined that f(a) = a,f~'(b) = b. 
Also, for any x eaxb — (a + 6), f (x) separates M irreducibly between a and b 
into just two components. It results from this that in case M is unicoherent, 
f ts necessarily monotone. 

Now suppose azb is a simple arc and g(x) is any interior transformation defined 
on axb. Let M be any locally connected continuum containing azb and such 
that M is identical with the cyclic chain C(a, b). Let f be an interior trans- 
formation retracting M into azb as given by the above theorem. Then the 
transformation g = gf is interior; and since for xeaxb we have f(x) = z and thus 
g(x) = g(x), gis an extension of g(x) toM. Hence we have 

(3.2) ExTENsION THEOREM. Any interior transformation on a simple arc 
axb can be extended (interiorly) to any locally connected continuum M containing 
axb which is of the form M = C(a, b). 

Now let M be any locally connected continuum whatever and let azb be any 
arcin M. Since’ M can be retracted into any A-set in M by a monotone trans- 
formation, there exists a monotone transformation g(x) which retracts M into 
C(a, b). Applying (8.1), we get a non-alternating interior transformation f(y) 
which retracts C(a, b) into azb. The transformation g(x) = fg(x) is then’ 
non-alternating and it retracts M into axb. Since g(x) = f(x) for x e C(a, b) 
and f is interior, it follows that when considered on C(a, b) alone, ¢ is interior. 
Thus we have 

(3.3) THrtoreM. Any locally connected continuum M can be retracted into any 
arc axb in M by a non-alternating transformation g(x) which is interior when 
considered on the cyclic chain C(a, b) alone. 


4, Retractions into simple closed curves 


A locally connected continuum M is said to be unicoherent about a simple 
closed curve J in M* provided that for every decomposition M = H + K where 
H and K are closed sets such that H.-J is an are zry and K-J an arc zsy, the 
points x and y belong to the same component of H.K. 

(4.1) Lemma. If M is cyclic and not unicoherent about the simple closed curve 
J CM, there exist disjoint closed sets X and Y intersecting J in single points x and 
y respectively and disjoint connected regions R and S in M containing the open 
arcs xry and xsy respectively of J and such that M — (X + Y) = R+ Sand 
F(R) = X + Y = F(S). 

Proor. By hypothesis and a theorem of W. A. Wilson’ there exists a decom- 
position M = H + K where H and K are continua such that H-J is an arc 





* See W. A. Wilson, American Journal of Mathematics, vol. 55(1933), pp. 135-145. 
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amb and K-J an are anb. Let U be the component of M — K containing 
amb — (a + b) and let W = M — U. It follows from our hypothesis that 
F(U) = A + B where A and B are disjoint and closed and a ¢ A, b ¢ B. 

Let us decompose U upper semi-continuously” into the sets A, B and indi- 
vidual points of U. Let U’ be the hyperspace of this decomposition and 
w(U) = U’ the associated continuous transformation. Clearly U’ is a locally 
connected continuum. Furthermore, if a = w(A), 8 = w(B), then since M is 
cyclic and neither A not B cuts U (so that a and 8 are non-cut points of U’), 
it follows that U’ is identical with the cyclic chain C(a, 8) in U’. Hence if 
auf denotes the arc w(amb), by (3.1) there exists a non-alternating interior 
transformation g(x) which retracts U’ into auB so that a = g(a), B = g (8) 
and for every uw € auB — (a + B), 9 '(u) separates U’ irreducibly between a and 8 
into just two components. 

Let v be a point of aw — (a + 8) and let V = wg '(v). Since U is connected 
and V is a compact subset of U, there exists a (compact) continuum N with 
VCN CU. Since gw(N) contains neither a nor 8, we can find interior points 
fand 7 on auf so that we have the order a, é, v, 7, 8 and so that the open segment 
Q = tun — (E + 7) of auB contains gw(N). Let 


X=w'g'®, Y= wg") 
R = wg '(Q), S=M—(R+X+/Y). 


These sets satisfy the conditions of our lemma as will now be shown. 

Obviously X-J and Y -J are points z and y respectively and R.J and S.-J are 
the open ares xry and xsy respectively of J. Also F(R) = X + Y = F(S), 
since each of the sets g ‘(¢) and g ‘() separates U’ irreducibly between a and 8 
into just two components and w is topological on U. For the same reason, the 
closure of each component of R must intersect both X and Y. Hence each such 
component must intersect V and thus N. As N C RB it results that R is con- 
nected. Similarly, each component of S must intersect either A or B and hence 
W; and since W is connected, Sis connected. This completes the proof. 

(4.2) THrorem. In order that a locally connected continuum M be retractable 
into a given simple closed curve J © M by a non-alternating interior transformation 
it is necessary and sufficient that M be cyclic and not unicoherent about J. 

We shall first show that the conditions are sufficient. Let X, Y, R and S be 
sets satisfying Lemma (4.1). Let us decompose M into the sets X, Y and 
individual points of M — (X + Y). Call M’ the hyperspace of this decomposi- 
tion and let h(M) = M’ be the associated transformation. Let h(X) = a’, 
KY) = b’, A(J) = J’ = A(ary) + A(xsy) = a’r’b’ + a’s'b’, A(R) = R’, h(S) = 
8S’. Then by (4.1) it follows that &’ is identical with its cyclic chain C(a’, b’) 
and 8’ is identical with its cyclic chain C(a’, b’). Hence by (3.1) there exists 


See R. L. Moore, Transactions of the American Mathematical Society, vol. 27(1925) 
pp. 416-428. 
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non-alternating interior functions ¢g, and ¢, retracting R’ and 8’ into a’r’b’ and 
a’s'b’ respectively so that g;'(a’) = ¢,'(a’) = a’, g, '(b’) = ¢, '(b’) = b’. Thus 
if we let g(x) equal ¢,(x) on R’ and ¢,(x) on S’, then ¢ is a non-alternating interior 
function retracting M’ into J’. 

We now define 


fle) = k[ph(2)]-J. 


(Nore: This equals h”’gh(x) except when ze X + Y.) Clearly f(x) retracts M 
into J. Since ¢h is interior, if G is an open set in M, gh(G) is open in J’; hence 
h~[gh(G)] is open in M, so that the intersection of this set with J is open in J. 
Accordingly f is interior. To show that f is non-alternating, let te J. Then 
f() =h'¢ h(t). If h(t) is either a’ or b’, say a’, theng h(t) = a’ andf ‘(t) = 
X, which does not separate M. If a’ ¥ h(t) ¥ b’, then g “A(é) cuts either R’ or 
S’, say R’, irreducibly between a’ and b’. Thus any component of M — f ‘(t) 
must intersect either X or Y; and since S + X + Y is connected, it follows that 
M —f‘(t)is connected. Thus as no set f ‘(t) separates M, f is non-alternating. 

To prove the conditions necessary, let {(M) = J be non-alternating, interior 
and retracting, where M is a locally connected continuum and J is a simple 
closed curve in M. Let C(.J) be the cyclic element of M containing J. Then 
M = C(J). For if not, there exists a component R of M — C(J); and if u 
denotes the boundary of R and vis a point of R — R-f 'f(u), f(u) = 2, flv) = y, 
then f ‘(x) would separate the points y and v of f(y) in M contrary to the non- 
alternating property of f. Hence M is cyclic. To see that M is not unicoherent 
about J, we have only to take points a and b on J decomposing J into arcs 
axb and ayb and set H = f ‘(axb), K = f‘(ayb). This gives M = H + K, 
H.J = axb, K-J = ayb; and since f(H-K) = a + b, obviously a and 6 lie in 
different components of H.K. 

It will be noted that the transformation f(x) set up in the sufficiency part of 
the proof just given is so constructed as to have the following additional prop- 
erties: 

(4.21) There exist open intervals a and B of J such that if x ea, y €B, f(z) + 
f~"(y) separates M irreducibly into just two components. 

(4.22) If p'(M) = 1, f is monotone (or if p'(M) = k, then for each x € If (@ 
has at most k components). 

The second of these is an immediate consequence of the first. 

The same device as used above to obtain (3.2) yields 

(4.3) ExTeNsion THEOREM. Any interior transformation on a simple closed 
curve J can be extended (interiorly) to any cyclic locally connected continuum con- 
taining J which is not unicoherent about J. 

Also the method of obtaining (3.3) from (3.1) when applied to (4.2) yields 

(4.4) THrorem. If M is any locally connected continuum and J is a simple 
closed curve in M about which M is not unicoherent, M can be retracted into J by a 





11 p'\(M) denotes the 1-dimensional connectivity number of M. 
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non-alternating transformation which is interior when considered only on the cyclic 
element C(J) of M containing J. 

In conclusion we remark that since by a result of Wilson,” if a locally con- 
nected continuum M is multicoherent, it contains some simple closed curve 
about which it is not unicoherent, we have that Jn order for M to be retractable 
into a simple closed curve by a non-alternating interior transformation it 1s necessary 
and sufficient that M be cyclic and multicoherent. 
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ON A STATIONARY SYSTEM WITH SPHERICAL SYMMETRY 
CONSISTING OF MANY GRAVITATING MASSES 


By ALBERT EINSTEIN 
(Received May 10, 1939) 


If one considers Schwarzschild’s solution of the static gravitational field of 
spherical symmetry 


M 
" ae 
(1) ds’ = -(1 + x) (da? + da + dx}) + _* dt’ 
2r 1+ “ 
2r 
it is noted that 
_#\’ 
gu = — 
+2 


vanishes for r = 4/2. This means that a clock kept at this place would go at 
the rate zero. Further it is easy to show that both light rays and material 
particles take an infinitely long time (measured in “coérdinate time’’) in order 
to reach the point r = y»/2 when originating from a point r > u/2. In this 
sense the sphere r = u/2 constitutes a place where the field is singular. (u repre- 
sents the gravitating mass.) 

There arises the question whether it is possible to build up a field containing 
such singularities with the help of actual gravitating masses, or whether such 
regions with vanishing gs do not exist in cases which have physical reality. 
Schwarzschild himself investigated the gravitational field which is produced by 
an incompressible liquid. He found that in this case, too, there appears a 
region with vanishing gu if only, with given density of the liquid, the radius of 
the field-producing sphere is chosen large enough. 

This argument, however, is not convincing; the concept of an incompressible 
liquid is not compatible with relativity theory as elastic waves would have to 
travel with infinite velocity. It would be necessary, therefore, to introduce a 
compressible liquid whose equation of state excludes the possibility of sound 
signals with a speed in excess of the velocity of light. But the treatment of any 
such problem would be quite involved; besides, the choice of such an equation 
of state would be arbitrary within wide limits, and one could not be sure that 
thereby no assumptions have been made which contain physical impossibilities. 

One is thus led to ask whether matter cannot be introduced in such a way 
that questionable assumptions are excluded from the very beginning. In fact 
this can be done by choosing, as the field-producing mass, a great number of 

922 


smal 
prod 
clust 
mov 
met! 

W 
all y 
clust 
dist! 
that 
mas 
alw: 








ON STATIONARY SYSTEM WITH SPHERICAL SYMMETRY 923 









small gravitating particles which move freely under the influence of the field 
produced by all of them together. This is a system resembling a spherical star 
cluster. Hereby we may proceed as if the field, in which the particles are 
moving, were produced by a continuous mass distribution of spherical sym- 
metry, corresponding to the whole of the particles. 

We can further simplify our considerations by the special assumption that 
all particles move along circular paths around the center of symmetry of the 
cluster. Even in this case it is still possible to choose arbitrarily the radial 
distribution of mass density. The result of the following consideration will be 
that it is impossible to make gi zero anywhere, and that the total gravitating 
mass which may be produced by distributing particles within a given radius, 
always remains below a certain bound. 







of 







1. On the paths of the particles and their spacial distribution 
By a suitable choice of the radial coérdinate, it is possible to obtain the 
gravitational field of the cluster of spherical symmetry in the form 
(2) ds’ = —a(dx} + dx} + dx3) + bd, 


whereby a and b are functions of r = (zj + 23 + a3)’. First we shall investigate 
the circular motion of one particle around the center of symmetry. Suppose, 
for instance, this motion takes place within the plane z; = 0. Through the 
introduction of polar coédrdinates 









v3 = r cos v, 





2, = rsin # cos ¢, 





te =rsind sin ¢g, 





— ws 


(2) assumes the form 





(2a) ds’ = —aldr® + r°(dd* + sin’ 3 dg*)} + bd?’ 
The field is characterized by 

gu = —4, 9x3 = —ar’ sin’ 8, 

gz = —ar, gu=b, 


where all the rest of the’g,, vanish. The particle under consideration satisfies 


the equation 
d’ x dx_ dxg 
3 P Be ype ole ws 0 
8) ds* °ds ds 
In addition its motion is determined by the conditions 
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It turns out that (3) is satisfied when 


1 dzz dz 1 dz, dz, 


Vs 7 a + Pua ase = 0, 
or when 
2 de F 
Because of (2a), we have 
ds , yA 2 a) 


Thus, dg/dit and ds/dt are determined when the field is given. 
Because ds” has to be positive for the world line of a particle in motion we have 


ds Fs 2 a) = 2 b’ 
(*) =b—ar (% = b —ar (ariy’ > 0, 





or 
b’ 
b 

(6) 1— Carty > 0. 
ar? 


By applying this condition to Schwarzschild’s field (1) we obtain 
(6a) r>5@+ v3). 


It follows that in the case of a Schwarzschild field a particle is bound to follow 
a path with a radius greater than (2 + +/3) times the radius of the Schwarzschild 
singularity. This fact has the greatest significance for the following investiga- 
tion: In the outermost layer of our particle cluster (and beyond it) the gravita- 
tional field is given by (1). It follows that the total gravitating mass of the 
cluster determines a lower limit for the radius of the cluster; this radius is (in 
coérdinate measure) more than (2 + +/3) times greater than the radius of the 
Schwarzschild singularity as defined by the field in the empty space outside 
the cluster. 

The normal to the plane in which the particle considered moves has the 
direction of x3. If it is assumed that the normals to an infinite number of such 
planes are distributed at random and also that the phase angles of the paths are 
subject to a random distribution, then we obtain a cluster of particles of spherical 
symmetry whose paths have the radius r. The most general cluster to be 
considered by us consists of an infinite number of clusters of this special type 
which belong to all values of r. (More accurately speaking, the whole cluster 
consists, of course, of a finite number of particles so that a field is created which 
only approximates spherical symmetry.) 
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In order to formulate the conditions of dynamical equilibrium of the cluster 
under the influence of its own gravitational field, we first have to compute the 
energy tensor belonging to such a cluster. For this purpose we assume, for the 
sake of simplicity that all particles have the same mass m. 


2. The Matter-Energy Tensor of the Cluster 


We consider the motion of particles within a volume element on the z3-axis. 
The velocity vectors all have the same amount, they are perpendicular on the 
z-direction, and they are evenly distributed with respect to the directions 
within the 2, %- plane. We know further that the matter-energy tensor 
depends also on the particle density and on the gravitational potentials, but 
not on the derivatives of the latter. It is, therefore, possible to determine this 
tensor by a straightforward calculation. 

First we consider particles, with the mass m and the particle density m per 
unit volume, at rest with respect to a codérdinate system of the theory of re- 
stricted relativity. In such a case of the energy tensor only the (44)-com- 
ponent exists, 

{om day 
T“ = mn ‘= a 
With respect to codrdinate systems in relative motion in the z-direction we 
have the components 


” pahh dz dz; Ay dx, dx, 
T= OT a ts ae 
14 dx; dx, 
| = mn ds a ° 


The particle density n with respect to such a system is determined by the 
equations: 

mVo = nV, Vods = V dt, 
where Vp and V denote the rest volume and the coérdinate volume respectively. 
Therefore we have 


y= 
= dz, 
We now consider the case when the velocity vector of the particle makes an angle 


a with respect to the 2;-axis, and is perpendicular to the Tyraxis. By using the 
relations derived above and by introducing dl = dz? + dx} , we obtain 


eet ds {dl 2 tel ds (2) . 

T = mn =(%) Cos’ a, TT” = mn dz, \ds cos @ SiN a, 
22 ds (dl\’ . » “ea ds dl dx 

T = mn dit, (2) sm a, T" = mn da. de ds COs a, 
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TH = mn $¢ (He) a mn 1 OM ain a, 
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all the other components of the energy tensor being zero. In the case that the 
velocity vectors are evenly distributed over all values of a the result is 


ds {dl 
= 2_1 —s oid 
™ «Tf gmn dz, ( 2) Tu T 22, 


T = mn du _ 7, 


ds 
We now proceed to the case that the components of the metric tensor are gy, = 
Jo2 = Ys3 = —aand gy = b. The components of the energy tensor are obtained 


by applying the transformation law for tensors and by transforming the co- 
ordinates according to 


dz, = a'dz, 
dx, = b' dz, ° 
We obtain 
Ty = (2) Tu = aT un, 
Tu = (=) T4 = bT a. 


dl and dz,, contained in 7, and T'4, are to be replaced dl by a’ dl and da, by b*da,. 
Further we have to introduce the particle density with respect to the new co- 
ordinates, 7, according to 
ndx,d2x_dxz3 = Nd, di_dzz 
or 
n= na}, 


After having made all these transformations and substitutions, and omitting 
the bars denoting the new coérdinate system, we obtain 


ds {dl 
TT, =Ta= a 
ul 22 Amna? b Pm ‘( o). 


(7) 
—3 py 2X4 
b 7s’ 


In these equations ds/dx, and dl/ds have to be replaced by the expressions given 
by (4) and (5) which were derived from the equations of the geodesic lines. 
Further we write dt instead of dz, and rdg instead of dl. The final result is 


’ 4 
Tu = Tx = anna? (— ne ), 


— p’ 
6 Tu = mna ( ) 
b a’ — p’ , 


Tu = mna 
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where a and 6 denote the expressions 
a = In (r’a), 


(7b) B = Inb. 


3. The Differential Equations of the Gravitational Field 
The differential equation of a gravitational field which is due to a matter- 

energy tensor are 

(8) Gw = Rw — 39R + «Ty = 0. 

These equations have to be specialized for a static field of the type (2). By a 

straight forward calculation the following equations are obtained for a point on 

the 2s-axis: 


oe oe ter. ley 
®) ~G0 = 4+ +3 (2) +525 = 
__ lfvy 1/vhY 1a 10 1/b’\ o 
We (2) 5(5) - 35-35 -a(g) toto 
"\t ’ \2 
(11) ‘Gu = (£) +9%..4:5 =) + Tut = 0. 
b a ra s44\a b 


For 7, and T'4 we have to substitute the expressions given by (7a), (7b). Asm 
is to be considered a given constant, the only functions of the coérdinates in 
these equations are n, a, and b. It is to be expected in the first place that n, 
ie. radial distribution of matter, remains undetermined by the equations. 
This makes necessary the existence of an identity between the equations (9), 
(10), (11). In fact such an identity exists. Its form is 


/ , , 
(12) oma + (2438) Gu (724+ 2) Gut 3F Gu. 
es r a 2b 

It may be obtained in the following way: We have constructed 7',,, by consider- 
ing particles which satisfy the equations of motion in the field. Therefore the 
covariant divergence of this tensor is bound to vanish identically. On the other 
hand, the divergence of R,, — 49,2 vanishes identically on account of the 
Bianchi identities. Of these four equations having the form of divergences only 
the one with the index 3 yields anything which does not already vanish identi- 
cally with respect to the G,, , and that is (12). From the form of (12) it follows 
that (10) is the consequence of (9) and (11). The problem is therefore reduced 
to (9) and (11), and the particle density remains undetermined, as was to be 
expected. 

This result makes possible a further simplification of the problem. If, in (9), 
the quantities a = In (r’a) and 6 = In b are introduced, we obtain the equation 


(13) _ a + a’p’ = 0. 
; fF 2 
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By taking into account (13) and (7a), we obtain from (11) 


(14) oye yl e_° dee Bh. teh me 
¥ r | 4° op 3 we 2 ‘ 
2 r 
This is a differential equation foraalone. When ais already known 6 is obtained 


by a simple integration from 


Se es 
(13a) B -1(2-32"). 


4. Localization of the Particles within a Thin Spherical Shell 


Outside the cluster, the gravitational field is represented by Schwarzschild’s 
solution which, with our choice of the coérdinate system, is given by (1). Inside 
the cluster, the field is determined by (14). Thereby, the function n is to be 
considered as given. However, nis not completely arbitrary, as the total radius 
of the cluster is restricted by the lower limit given by (6a). 

Equation (14) represents a complicated relation between the particle density 
n and the function a representing the gravitational field. The limiting case, 
however, in which the gravitating particles are concentrated within an infinitely 
thin spherical shell, between r = ro — A and r = 179, is comparatively simple. 
Of course, this case could only be realized if the individual particles had the 
rest-volume zero, which cannot be the case. This idealization, however, still is 
of interest as a limiting case for the radial distribution of the particles. 

We divide the whole space into three zones for separate consideration, part O 
to be the part outside the shell, r = 7, part J to be the part inside the shell, 
r S ro — A, and part S to be the part of the shell —A SrZv7. In QO, the 
gravitational field is represented by (1), in J, it is represented by (2) with 
constant values of a and b. It follows that a’ (and a’) have to change within S 
the faster the smaller A is chosen. However, as a’ remains finite in S, a itself 
changes only infinitely little in S. It is, therefore, permissible in S to neglect a’ 
compared with a’’. We therefore replace (14) within S by 


of. 2 
oe 
2 a 
where a and r are to be treated as constants for integration purposes. We 
introduce the variable 


(14a) a’ + xkmna— = 0, 


and the “constant” 
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1 
= 


zis hereby determined as a function of r within S if n is given as a function of r. a) 
When the integration is carried out between rp — A and 79 we obtain aD || 













and obtain the equation 








(14b) ) dz = Cndr. 
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(15) |2 — arctgz |7°* = a N=-~vV2a? = i 
4nro Sar To i | 
where N designates the number of particles in S. It follows from (1) that for ip 





fr = 10 Hy | 















— 40 + 0°)! hil 
and from (2) that, because of a and b being constant in J, in J | ne 
(15b) Zry—-a = V2. i 
It follows from (6a) that d | ie 
oe et a tn al iy | 
2+ V3 | 





It turns out that this is just the condition for the numerator of the expression 
for z,, to be real. (15), for each possible 7», gives the relationship between the 
sum of the masses of the particles, mN, and the total gravitating mass u of the 
cluster. For large values of 7) , with a fixed value of u, one obtains in the limit 











K 
y= 2 mN. 





(16) 






The factor x/8x is due to the fact that m is measured in grams, u, however, in 
gravitational units. (16) therefore simply states that in this limiting case the 
gravitating mass of the cluster is equal to the sum of the particle masses. Tan 
The most illuminating way to express this result is the following: | 
Outside the shell (r = 70), the gravitational field is given by 
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ds* = — (1 4. 4) (dxi + dx: + dx3) + - or af 
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Inside the shell it is given by the same expression, with the difference, however, 
that r is to be replaced by the constant 7> , whereby the inequality 





rm > 5 (2+ v3) 
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must be satisfied. The number N of particles of the mass m which together form 
the shell is given by the following consideration: As an abbreviation we introduce 


at “3 el 
Then we have 
oe [(W2 — arctg V2) — (z,, — arctg z,,)](1 + 0)? 








z= $(c) /8 ’ 
where 
at (1 — 40 + o°)! 
in, = V2 1 + 7 ‘ 
o can assume values between 0 and 2 — »/3(~-27). The quantity 
z2—o 





o 


is only very little different from zero in this whole region. A few typical values 
are given in the following table: 











z2—ao 
a 
o 

.05 .042 
14 .06 
2 .055 
23 .013 
27 — 0.022 





This leads to a very interesting consequence: First it is clear that (2 — o)/o 
may be replaced by (2 — o)/2Z with good approximation and this by (M — y)/M. 
This latter quantity is the relative decrease of energy of the cluster when it 
contracts from an infinite radius to the radius 7». The table shows that this 
contraction energy has a maximum near o = 0.15, and for greater values of c, 
i.e. smaller values of ro , it decreases again. The physical cause of this effect is 
that, with decreasing 7, the potential energy of the cluster decreases, but the 
kinetic energy increases. For sufficiently small values of rp the latter effect 
surpasses the former. 

It is therefore clear that the decrease of the radius with decreasing energy 
would come to an end for a value of about o = 0.15, i.e. a radius of about 
6.7(u/2ro), while the lower limit of the radius as given by the velocity of light is 
(2 + V3) (u/2ro). The value of r corresponding to the minimum energy means 
an upper limit for the particle velocity in the direction of the tangent of about 
0.65 times the light velocity. 


5. Qualitative Discussion of the Case of Arbitrary Radial Mass Distribution 


We consider the case of a given mass uw and a shell radius 7 satisfying the 
inequality (6a). When a number N of particles is brought into this shell zone, 








ON STATIONARY SYSTEM WITH SPHERICAL SYMMETRY 931 










as determined by (15), then the exterior gravitational field is just completely 
screened off from the interior J so that there the fieid will be Euclidean. This 
means that the line element in J is characterized by constant values of a and b, 
where 6 cannot reach its lower limit 1/+/3. 

If, however, the number of particles in S is chosen smaller than according to 
(15) then the field will not be screened off entirely (u is hereby regarded as being 
kept fixed). We can then satisfy the theory formally by replacing the Euclidean 
line element in J by a Schwarzschild line element of the form 


m\* 

4 an 
a= A(i+%), b=B i 
2r 14 

2r 
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where A, B, and yw; are constants. , will be smaller than » which characterizes it; 
the field outside the shell. This interior field has a singularity of the Schwarz- 
schild type (b = 0) atr = m;/2. 
This singularity, however, can be removed by introducing a second shell S; | 
inside S, which has to be constructed so that the gravitational field in its interior i 
will be Euclidean. The whole cluster will then consist of two shells S and 8S, 
and will have no Schwarzschild singularity. i 
Again this system can be modified by reducing the number of particles in S; 
so that it will not screen off its exterior field (between S and S,) entirely; then a if 
third shell S. , of still smaller radius, may be constructed so that its exterior field 1 
is just screened off entirely from its interior. \ 
This method can be reiterated up to the center of the cluster. Thus one We 
obtains clusters with the most varied radial mass distributions. There will be i ii 
also various steady distributions. It is impossible, however, that b should 
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3 vanish anywhere. The radius of the cluster will always be greater than the | 

t limiting radius $u(2 + +/3), and it will not be possible to concentrate the matter } H\| 

a of the cluster arbitrarily densely near the center of the cluster. 

6. The Case of Continuous Particle Density iH 
vel} 

. The consideration given in part 5. leads toward the solution for continuous | H | 

distributions of the particle density. We divide the interval 0 S r S 7% into an Ht) | 





infinite number of equal parts dr. We imagine that there is constructed in the 
center of each partition dr a shell of a two dimensional character of the type 
discussed in part 4. The shells may be chosen so that they are equivalent to a 
continuous distribution of mass. Between any two subsequent shells we shall 
have a gravitational field of the Schwarzschild type 














1--\* 
4 
(17) ds’ = —A (1 + z) (dx? + dz? + dz) +B —* dt’, 
ea 
2r 





where A, B, and 7 are constants which differ only infinitesimally for two neigh- ih | 
boring regions. Then the sum total of all these partial solutions constitutes the hi 
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gravitational field inside the cluster. Our task is to determine A, B, and 7 as 
functions of r. 

We consider two neighboring Schwarzschild solutions which belong to the 
radius intervals r — $dr tor + 3drandr + 4drtor+ dr. In the first region 
the values of A, B, and 7 belong to the value r of the radius, in the second to the 
value r + dr. If we use the quantities introduced by (2) then the two local 
solutions are given by 


a(r; A, 7), a(r; A + dA, r + dr), 
and 
b(r; B, 7), b(r; B + dB, + + dr), 


where a, b are functions of r in accordance with (17). These two solutions are to 
assume the same values for a and b in the point r + $dr because these quantities 
must not change when we pass through a shell occupied by particles. It follows, 
up to quantities of the first order 


or dA + Sar = 0, 


ab ab 
or, in accordance with (17) 


dA, 4rdo + odr _ 
A r i+e 
qB 4 rdo+odr 
B r(1+o)(1 — oa) 


where o is written for 7/2r. 

These equations determine A, B as functions of r when 7 or a is given as func- 
tion of r. It turns out that a, 8, computed from the solutions A, B of (18), are 
the solutions of (13), represented with the help of the “parameter” function co. 
7 is arbitrary within certain limits because it is closely connected with the mass 
distribution. On the other hand, A, B, and 7 have to satisfy the condition that 
(17) makes possible circular particle paths for all values of r, i.e. a and b have to 
satisfy the inequality (6). In connection with (17) we obtain the inequality 


eo) ae 
(19) 1 aa “a + 0 —9), 
f+ 244 7 


0, 
(18) 





= 0, 








ee 


(18) and (19) together completely determine the problem within the cluster; 
@ is arbitrary save for the only restriction that, together with the values of A and 
B, calculated from (18), it has to satisfy (19). 
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For r 2 7 we have, of course, A = B = 1, with r = const. = u. 
By using (18) we may write (19) thus: 





4 o 
1- (i+o(l-o) J 9 


o 
siete x Y- 











or, with some transformations: 


(oc — 2+ V8)(o — 2 — v2) > 0. 
(1 — a)? 


This inequality has to hold within as well as outside the cluster. For infinite 
values of r, o vanishes. Further o has to be positive, as negative masses are 
excluded. Because of the denominator, « can nowhere be greater than 1. 
Therefore the numerator of the left hand side has to be positive. As the second 
factor of the numerator is always negative the first factor has to be negative, too. 
We therefore obtain 


(19b) ¢<2- V3. 


This is a generalization of (6a) as (6a) was only proven to hold for the outside 
boundary of the cluster. 

7 represents the mass enclosed by the spherical surface of the radius r. In 
order that negative masses should be ruled out it is necessary that everywhere 


dr 
po = 0. 
It is further necessary that 7 vanishes for r = 0. Save for this condition + 
may be chosen arbitrarily if only o satisfies (19b). When 7 and therefore oa is 
given then the problem of determining the gravitational field of the form (17) 
is reduced to the carrying out of two integrations, according to (18). 

The equations (18) give us the integration of (13) with arbitrary mass density 
distribution, where the latter is expressed by 7 or ¢. (14) gives the correspond- 
ign particle density n. We shall express n in terms of ¢. We have 





(19a) 


(20) 





2/1-c\’ 4 oa ne UR 
21 eee ae ee 
(21) " 2(}-~) ait V1 — 40+ 0? 
together with the relations 
A’ 4ro’ +o 
2 = 4 — io. 
( 2) a A(l + a) ’ A rl + a 


Therefore, when o is given as a function of r we obtain n by carrying out one 
integration only. 

¢ is positive and stays below the limit 2 — 1/3. The square root of the 
denominator of the third term in (21) therefore is always positive. We further 
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have 7/2r where 7 is the gravitating mass contained in a sphere of the radius r. 
7 therefore increases monotonically with increasing r. If the mass density is 
to be finite in the region around r = 0 then 7 has to decrease in that region at 
least as fast as r° and o at least as fast as r’. Under these conditions the two 
first terms in (21) will be finite everywhere, and also A’/A, A, and a. (21) 
therefore gives us a finite value for n. It is further possible to prove from the 
properties of 7 that the sum of the two first terms in (21) is negative everywhere. 

From all these considerations it can be followed that a and b are finite and not 
zero in the whole space. 

By combining (2), (4), (17), and (18) one can show that the ratio V between 
the particle velocity and between the light velocity pointing into the same 
direction, is given by 


(23) ya a 


a’ (1—o)* 





When o stays below a given limit V will stay below a certain limit, too. 


7. A Special Case of Continuous Mass Distribution 


It is of some interest to investigate the case where o inside the cluster is a 
constant oo. Strictly speaking this case falls outside of our conditions as o 
ought to decrease toward the point r = 0 at least as fast as r’ in order that the 
density in the neighborhood of the center should stay finite. We can satisfy 
this condition by choosing o for instance 


(24) og = o(1 — oe 


where c is to be an arbitrary constant. We then consider from the start the 
limiting case of c = «. This special case is discussed here in order to supple- 
ment the discussions of part 4. There the whole mass was distributed as far 
outside (within the total radius 79) as possible, while here we have a strong con- 
centration of mass toward the center of the cluster. 

As 7 is the gravitating mass enclosed by a spherical surface of the radius r, 
dr/(4nr’dr) is the mean density of the gravitating mass in the point r. As 
t = 2oor we obtain for this mean density 00/277’, i.e. a radial decrease of the den- 
sity like 1/r’ up to the cluster boundary r = 7%» . 

From (18), in accordance with (24) (in the limiting case of vanishing expo 
nential term), we obtain 
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and since for r = 7, A and B have to assume the value 1 


r —4¢9/(1+09) 
A=(2 

To 

r 4¢9/ (1-03) 
B= {— , 

To 


For r = 0 we obtaina = ~ andb = 0. This type of singularity, however, is 
not to be taken seriously because it would be avoided if we had taken into con- 
sideration the exponential term in (24). It is to be noted that through a suitable 
choice of the mass distribution this singularity can be approximated, but not 
reached. 

We make use of (21) in order to determine the relation existing between the 
sum of the rest masses of the particles M 








(18b) 














K "° 2 
M= Em | n-4xr° dr, 
8r 0 






and the total gravitating mass of the cluster u. It can be shown that the first 
term of (21) gives only a vanishing contribution for infinitely great values of c. 
l-@ 


This follows from the fact that ( 
l+o 


of the exponential term of (24) has become unnoticeable. We compute the 
contribution of the second term in (21) by omitting the exponential term from 
the start and obtain, after a short calculation, as the final result, with 4» = 2roco 


1 + oo 


(1 oo a)? 





, 
) vanishes everywhere where the influence 












(25) M = wll — 400 + 0)? 






This equation when compared with the relation 


(26) 





LL = 2Qo0r0 










allows an easy discussion of the essential properties of clusters of this type. d ! 

First it is easy to see that we have extremely simple relations when we change Ae | 
M but keep fixed oo (0 < oo < 2 — +/3) and thereby the tangential velocity of ‘Wy 
the particles as measured in light velocity units. When M is multiplied by z i 
the gravitating mass will be zu and the diameter of the cluster will be z-2r. 
The mean density will be multiplied by 1/2’. 

In order to obtain a survey of all possibilities it is therefore sufficient to keep 
fixed the number of constituting particles and thereby M and to vary oo together 
with the diameter 2r) and the gravitating mass. We obtain for M = 1 


_ (1 = 0)” 
- bee 












(1 — 409 + 05). 
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The following table gives » and 279 for M = 1 as functions of oo (approximately): 














d0 Le 2ro 
0. | a oo 

.05 .988 19.76 
p | .948 9.48 
15 .97 6.56 
2 1.13 5.65 
.23 1.32 5.63 
25 1.82 7.40 
.26 2.63 10.1 
.268 oo oo 


When the cluster is contracted from an infinite diameter its mass decreases at 
the most about 5%. This minimal mass will be reached when the diameter 27) 
is about 9. The diameter can be further reduced down to about 5.6, but only by 
adding enormous amounts of energy. It is not possible to compress the cluster 
any more while preserving the chosen mass distribution. A further addition 
of energy enlarges the diameter again. In this way the energy content, i.e. the 
gravitating mass of the cluster, can be increased arbitrarily without destroying 
the cluster. To each possible diameter there belong two clusters (when the 
number of particles is given) which differ with respect to the particle velocity. 

Of course, these paradoxical results are not represented by anything in physi- 
cal nature. Only that branch belonging to smaller oo values contains the cases 
bearing some resemblance to real stars, and this branch only for diameter values 
between « and 9M. 

The case of the cluster of the shell type, discussed earlier in this paper, behaves 
quite similarly to this one, despite the different mass distribution. The shell 
type cluster, however, does not contain a case with infinite u, given a finite M. 


The essential result of this investigation is a clear understanding as to why the 
“Schwarzschild singularities” do not exist in physical reality. Although the 
theory given here treats only clusters whose particles move along circular paths 
it does not seem to be subject to reasonable doubt that more general cases will 
have analogous results. The “Schwarzschild singularity’ does not appear for 
the reason that matter cannot be concentrated arbitrarily. And this is due to 
the fact that otherwise the constituting particles would reach the velocity of 
light. 

This investigation arose out of discussions the author conducted with Professor 
H. P. Robertson and with Drs. V. Bargmann and P. Bergmann on the mathe- 
matical and physical significance of the Schwarzschild singularity. The problem 
quite naturally leads to the question, answered by this paper in the negative, 
as to whether physical models are capable of exhibiting such a singularity. 
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1. The Spinor Formalism 





In this paper the two component spinor formalism will be used to study the 
effect of geometric spin transformations corresponding to both proper and im- 
proper Lorentz transformations on the tensor equations which Whittaker’ and 
Ruse’ have shown to be equivalent to the Dirac equations. The spinor form of 
the Maxwell equations given by Laporte and Uhlenbeck’ is obtained with the 
aid of the formalism. These equations are compared with the equations pro- 
posed by Dirac* and used by Kemmer”’ for material particles with spin one. 
The tensor equations equivalent to these are obtained and are shown to differ 
from the equations proposed by Proca.® Throughout this paper we shall use the 
notation developed by O. Veblen in a seminar conducted by him and W. Givens.’ 

The Dirac equations when written in two component form are 




















: A 
= —imce’, 





(1.1) 





oA h fe] é ees <A 
g (#2 +2en)e umey’ , 





where m and e are the mass and charge of the electron, h is Planck’s constant 
divided by 2z, c is the velocity of light, y, the electromagnetic four vector poten- 
tial, and y*, »*, andg e4arespinors. The components of the spinor g°4» depend 
on the codrdinate system in two spaces, namely, the spin space and the space of 















1E. T. Whittaker: ‘‘On the Relations of the Tensor-Calculus to the Spinor-Calculus,”’ 
Proc. Roy. Soc. 158A, pp. 38-46 (1937). 

2H. S. Ruse: ‘On the Geometry of Dirac’s Equations and their Expression in Tensor 
Form,’’ Proc. Roy. Soc. of Edin., LVII, part II, pp. 97-127 (1936-37). 

’0. Laporte and G. E. Uhlenbeck: “Application of Spinor Analysis to the Maxwell 
and Dirac Equations,’’ Phys. Rev. 37, p. 1380 (1931). 

*P. A.M. Dirac: ‘Relativistic Wave Equations,’’ Proc. Roy. Soc. A 155, p. 447 (1936). 

’N. Kemmer: ‘“‘Quantum Theory of Einstein-Bose Particles and Nuclear Interaction,” 
Proc. Roy. Soc. A 166, p. 127 (1937). 

® A. Proca: “Sur La Theorie Ondulatoire Des Elections Positifs et Negatifs,’’ J. de 
Physique et de Radium 7, p. 347 (1936). 

70. Veblen and J. Von Neumann: ‘‘Geometry of Complex Domains,’’ Princeton Mimeo- 
graphed notes (1935-36). Notes by W. Givens and A. H. Taub. References to these 
notes will be denoted by ‘‘notes.”’ 
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special relativity S, with codrdinates z’(x* = ct) and a metric tensor ge; which 
in a preferred coérdinate system has the components 


a) eo es 
| 0 =—1 0° Oo 
0 0-1 Of, 
0 0 oO +1 


(1.2) Il gor || = 














The spinor "a is related to the tensor g,, by the matrix equations 



































(1.3) 2(9°9° + 9'9°) = —9"1, 
where g” is determined by gz; in the usual manner, | is the identity matrix and 
g’ = || g°*n || and the bar denotes the complex conjugate. A solution of these 
equations is given by the matrices 
oo - ee | 1 0 
(1.4) of =f = 1 0|)’ = a= 0 ele 
eis me lag ‘| as -|° a 
y 1 | 
If y = | y and g = Ol equations (1.1) may be written as 
"pay = —imep 
(1.5) : ary 
J Pop = —imey, 
where p, = : < _ “ge. From this form of the equations it is readily verified by 


using (1.3) that both g and y satisfy second order wave equations. 

The tensor g,, and the tensor g”’ may be used to raise and lower tensor indices. 
The spinor indices are raised and lowered by antisymmetric spinors €4, and e*” 
of weight —1 and +1 respectively. In all coérdinate systems they have the 
components 


0 1 
(1.6) lel = He*l=|_9 ol) 
The rules for raising and lowering spin indices are given by the equations 
(1.7) Ve = easy” or h=V, b= -¥, 
and 
(1.8) y= yn. 


Since the spinors e4, and #4 are antisymmetric, the rules for raising are different 
from those for lowering an index. The rules given by equations (1.6) and (1.7) 
are consistent. They can be used on spinors with a number of indices. Dotted 
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indices are manipulated by spinors eis = é4, and e“* = ¢“*. Asa consequence 
of our rules we have 

(1.9) V4 = V4eang = —g4~,, 

hence 

(1.10) Vv. = 0 


If we apply the process of raising and lowering indices to the spinor gz‘z, 
we obtain two new spinors, g.“” and geiz. In the codrdinate systems in which 
(1.4) hold we have: 














oust |g], Howell =|) “oh, 
(1.11) 10 4 — 
lawell=|)° 5] and llouell=|5 yf, 
Wot=|~9 gf lott=]> ol, 
(1.12) | 
leet = | Gf and lot = {5 of: 














From these equations it is readily verified that 
(1.13) JoAB = Joba and a = a. 


that is, each of the matrices || g.4” || and || ges || is Hermitian. Equations 
(1.13) are spinor equations and hence hold in all coérdinate systems, since we 
assume that the spinor g’4” has weight and anti-weight equal to } and all 
contravariant simple spinors are of weight +4 and covariant ones of weight — 3. 
(See Chapter II §3 of notes.) 

Equations (1.3) may be written as 


(1.14) 4G" grac + 9:"" 9" ac) = 8780. 

By setting B = C and summing, we obtain 

(1.15) 9°" grin = 28%. 

Another identity satisfied by the components of the spinor 974" is 
(1.16) 9" geen = 26686, 


as may be verified by using the components of the spinor in the particular 
coérdinate systems. Equations (1.15) and (1.16) are the analytical conse- 
quences of the fact that there is a (1-1) correspondence between Hermitian 
second order matrices and points of S, and may be derived from that corre- 
spondence. 

The (1-1) correspondence between self dual antisymmetric tensors in S, and 
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symmetric spinors may be studied by using the spinor Ser‘ c defined by the ( 
equations , 
(1.17) Sorc = 1(g,°* grace es g:°* gen) = 4(9,45g0°c — a“sa"d. ( 
From equation (1.17) it follows that 


(1.18) Serta = 0 or Soran = Serza- 
From equation (1.14) and (1.17) we find 
(1.19) goo" grec - Sorc + GJerdc. 


By direct application of equation (1.16) we obtain the following useful 
equations: 


(1.20) S74? Sercv = 4(5¢6p + 8p6c), 
(1.21) B77" Sercv = 0, 

(1.22) 9°” Soran = —049r'B — OB Gr A, 
and 


(1.23) S" AB | ad = Mo", ai” e + i". gs*s + Pt.’ + rae gh 


From the second of equations (1.17) and (1.4) we find that in the special 
coérdinate systems 























—1 0 2 
A ce A ae 
ISw'sll=| 5° yf lSw‘el=] | 
0 1 0 2 
(1.24) || Sou“ || ad 1 0 ’ || Sis*z || * 1 0 bd 
0 | 9 
A a A — 
|| Soa B || sont 0\|’ || Sos a|| 0 Re | ’ 























and hence in this coérdinate system 
Sie = —1iSs ’ Sis = 1Se ; and Si => 1So3 . 


However, these equations may be written as 


& 3 
1.25 oe -. pror = my 
(1.25) rasa f=, 
where g = | g.;| (= —1 ina preferred codrdinate system) and e”” is the anti- 


symmetric tensor which is equal to zero unless all indices are different and then 
it is equal to +1 or —1 if uvor is an even or odd permutation of 1234. Since 
equations (1.25) are spinor equations, they hold in all coérdinate systems. 

From equations (1.24) and (1.4) the following equations may be proved by 
verifying them in the special coérdinate system, 


(1.26) SP", Sera = —2nh', 
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(1.27) S”s9)°c = — 7940, 
and 
(1.28) go BSc = mrg’*c, 
where 

1 
1.29 = 8 + — og, 
(1.29) n Vg * ede 


and 6s, is a generalized Kronecker delta, 52; = 426; — 6%; . 
From equation (1.25) it is evident that the tensor defined by the equations 


(1.30) Zw = wank, 

where F4” is a symmetric spinor is self dual, that is, 4, = z.. Multiplying 
equation (1.30) by S”°” and summing, we obtain as a consequence of equa- 
tions (1.20) 

(1.31) F°? = 42,8”. 


Equations (1.30) and (1.31) give explicitly the (1-1) correspondence between 
symmetric spinors and self dual tensors. Since equations (1.31) are inverse to 
(1.30) the latter must be identically satisfied when the former are substituted 
into them. Using this fact we obtain equations (1.26). This gives an alternate 
proof of that identity. 

Consider the determinant of the symmetric spinor F4, , we have 


(1.32) | Fas | I 41F“"F 4, = peter2”. 


Hence F 4 is singular if and only if z.,2°" = 0. Since any symmetric spinor 
may be written as 


(1.33) Fas = 3(Wayen + Papa) 
we have that 4 = py, if and only if z,,z”° = 0. Hence a self dual tensor deter- 
mines a simple spinor if and only if 2.2" = 0. 

We conclude this section by noting that an arbitrary real antisymmetric 
tensor f,, determines a self dual tensor by the relations 
(1.34) Zu» = Sw + le = $ (Zp + Zu) + 3 (Zur — 2p). 


Since j,, is imaginary, if f,, is real we see that a self dual antisymmetric tensor 
always determines a real antisymmetric tensor by the relations 


(1.35) Suv = 3 (Zur + 2). 
It may be proved that 
(1.36) 42 ane = I Se + AS Sands - 


If f,, is the antisymmetric tensor corresponding to an electromagnetic field, the 
right hand side of (1.36) is the expression for the stress energy tensor of this field. 
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2. Tensors Determined by the Solutions of the Dirac Equations 


In the preceding section we considered coérdinate transformations in the spin 
space and those in the space S, as unrelated. From this viewpoint the in- 
variance in form of the Dirac equations under arbitrary coérdinate transforma- 
tions and constant spin transformations follows immediately. (The restriction 
to constant spin transformations may be removed by replacing the ordinary 
derivative by a covariant derivative: notes Chapter III.) The Dirac equa- 
tions, however, have another invariance property, namely, the coefficients a 
are numerically invariant under an arbitrary Lorentz transformation if we 
replace y4 and ¢g4 by suitable linear combinations. This type of transforma- 
tion and the corresponding Lorentz transformation will be called geometric 
transformations and may be distinguished from codrdinate transformations by 
considering them as a permutation of the points of the space in contrast to a 
renaming of their codrdinates. 

This property of the Dirac equations is due to the isomorphism between the 
antiprojective group in the spin-space and the Lorentz group in S, which may 
be stated as follows: 

For every proper Lorentz transformation with coefficients L’, in S, there are 
exactly two unimodular matrices P = || P“s || and —P which satisfy 


(2.1) g = Py PL’, 


and for every improper Lorentz transformation there are exactly two unimodular 
matrices P = || P*s || and —P which satisfy 


(2.2) g = Pg P'L’,. 


In both cases the matrices P determine the Lorentz transformation uniquely 
and give a double valued representation of the Lorentz group. It should be 
noticed that the matrices of the type P = || P*s || determine a linear trans- 
formation of the form 


¢g* = Pg, 


A a 
|| P“z || determine an antilinear trans- 


whereas the matrices of the type P 
formation of the type 


eo = Pe. 


The numerical invariance of the coefficients of the Dirac equations under 
Lorentz transformations is readily proved by means of equations (2.1) and (2.2). 
Thus if we perform the proper Lorentz transformation 


x" pas L’,2", 





8 Notes page 19. 
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equations (1.5) become 


gL’ pew = —imee, 
g’L'.p; " —imey, 


where 
* h 0 @ ¢ , oe) 
pt = * (52, - Seer) and e.L’, = 5). 
If we now make the spin transformation 
(2.3) y*= Py, ¢g* = Pg, 


these equations become (1.5) in the new quantities. However, if L’, is an 
improper Lorentz transformation, we must make the substitution 


(2.4) y* ma P2, ¢* 5 Py, 


to regain equations (1.5). 

We now examine the following quantities determined by two simple spinors 
v, and ¢*: 
(2.5) T = V0. ? 


(2.6) AP=V iw’, B= egise’, C= VW G'isg’, 
and 
(2.7) Pp” am S” say y’, M” ie S” any" yp", R” oi Ss. ¢* y’. 


Under transformations of spin coérdinates these are all scalars. Under coérdi= 
nate transformations in S, they transform as tensors of the type indicated by 
their indices (7 is a scalar, A’ a vector, etc.). From equation (1.13) it is evi- 
dent that A, and B, are real; the remaining are complex, and P”, M*’, and R” 
are self dual antisymmetric tensors as follows from (1.25). A,,B,,C,, M.,, 
and R,, have been treated by Whittaker and Ruse has discussed P,, . 
Consider the behavior of these quantities under geometric transformations 
in the spin space: A transformation of the type (2.3) induces on these seven 
quantities the tensor transformation indicated by the indices. For example 


AM = PG iad” = Prod g’inP’ov” = L'd*g' av” = L’A’, 
where L’, is the proper Lorentz transformation determined by equations (2.1); 
since the fact that P is a unimodular matrix implies e4,P“cP”» = €cv whence 
Piz = —P3.. Similarly it can be shown that B, and C, transform as vectors, 
P,,, M., and R,, transform as antisymmetric tensors and T is a scalar. 


However, a transformation of the type (2.4) has a different effect on these 
quantities. Thus the antilinear transformation: 


* = 
y4 »'¢** oe pA p ¢" wag pe Py 
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sends ; 
ToT =" 95 = —Pae Pi be = 2's = -—T, 
aA a LS, CSP on LLP, 
(2.8) B’ > B* = LA’, M” > Mm’ = L’,LR”, 
Cc’ 3 OP = LC, R" = R* = 1" LM”, 


where L’, is now the improper Lorentz transformation determined by equa- 


tion (2.2). 

From these relations it is evident that the quantities 
(2.9) A=T -— fT, J° = A° + B’, 
and 


fer = Py, + FB, 


undergo a tensor transformation of the type indicated by their indices whenever 
y and ¢ undergo a transformation of the type (2.3) or (2.4). It is possible to 
form quantities which have an induced tensor transformation law out of com- 
binations of the quantities C’, M”, and R” and their conjugates. However, 
these are not gauge invariant quantities if y, and gy, are solutions of the Dirac 
equation, whereas those given by equation (2.9) are. This follows from the 
fact that if yg, is replaced by g, + 0S/dz’, the Dirac equations (1.1) are un- 
altered if we replace y* by eS y and ¢* by e “* 9*. Because C,, Mur, 
and R,, and their conjugates are not gauge invariant, neither they themselves 
nor quantities built from them by linear combinations can have a direct 
physical interpretation. However, gauge invariant quadratic combinations can 
be formed and these may have a physical interpretation. 

The seven quantities defined by equations (2.5), (2.6) and (2.7) are not all 
independent but satisfy identities some of which have been given by Ruse and 
Whittaker obtained from equations (1.16), (1.22) and (1.23). These are the 
same as the quadratic identities given by Pauli’ for the tensors built from four 
component spinors. In addition Whittaker’s “Catalytic’’ relations follow from 
the latter equations. 


3. Tensor Equations Equivalent to the Dirac Equations 
The tensor equations shown by Whittaker to be equivalent to the Dirac 
equations are 
1 


0’ = — 
2 


(M™ + R”), — i (M” — R”)o, 
Cc 
(3.1) 


— Weds + veeS)g” + ne (Cc + 0) = 0. 





* W. Pauli: ‘“‘Beitrage zur Mathematischen Theorie der Diracschen Matrizen,’”’ Zeeman 
Verhandelingen, Nijhoff (1935). 
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The quantity Q’ transforms as a contravariant vector under arbitrary coérdi- 
nate transformations and as a scalar under constant spin coérdinate trans- 
formations. A constant geometric transformation of the type (2.3) in the spin 
spaces sends 


a Q”* a L’ an’ 
and, one of the tvpe (2.4) sends 
Oo > a” as ES mig 


provided we perform the corresponding Lorentz transformation in S,. Hence 
equation (3.4) does not as it stands have an induced vector transformation 
law. However, it is equivalent to a pair of equations which do, since 2" = 0 
is equivalent to the two statements 

R= 40° + &) = 0, 
and 

r=i@-m <=0. 

27 

R’ has the induced transformation law of a contravariant vector and J" has the 
induced transformation law of a pseudo vector, that is, of the dual third order 
antisymmetric tensor associated with every vector. 

As was noted before, the quantities which enter into equations (3.1) are not 
gauge invariant and therefore cannot have a physical interpretation. Ruse 
obtained another set of equations, which do have a physical interpretation and 
which are equivalent to the Dirac equations, they are the real and imaginary 
parts of A’ set equal to zero where 
(3.2) A’ = _" Pre +9" (gepew” + ¥° Bove) + imed". 
The transformation properties of A” are the same as those of 2”. 


4, Maxwell’s Equations in Spinor Form 


Laporte and Uhlenbeck* were the first to use the (1-1) correspondence be- 
tween self dual tensors and symmetric spinors to obtain the spinor form of the 
Maxwell equations. The formalism of the first section enables us to obtain 
these results quite readily. We give them here so as to compare them with the 
equations recently proposed by Dirac‘ and used by Kemmer’ for material 
particles with spin one. 

The Maxwell equations in free space may be written as 


(4.1) 2, = 1 


where 2” is a self dual complex antisymmetric tensor and J’ is a timelike real 
vector, the current vector. In terms of the four potential vector ¢, we have 


(4.2) Zer = anor (Ou, rid Pru) 
































































946 A. H. TAUB 


If we write 
Zer = PawSox” and ge = go’ "Pin, 
equations (4.2) become on multiplying by S”’cp 
8F en = 28"cr((gi4"bis).» — (9\*ba8) n)s 
which may be written as 


(4.3) Fep = —3(9"4 pbc» + 94 Pinw), 


where we have used equation (1.22). 
Equations (4.1) may be written as 


oF as. an P eee PO ; 
Multiplying this by g.¢p and using (1.22), we obtain 
(4.4) gn F*? = 1°”. 


Equation (4.4) is fully equivalent to equation (4.1) since the latter may be 
derived from the former by using the formulas from the latter part of the first 
section. 

The equations proposed by Dirac for a free particle of spin 1 are in the nota- 
tion used here 


“en a KB*¢, 
(4.5) 9° icB4p,« = —KA CD y 


me 


K=-7, 


where m = mass of the particle and Agc is a symmetric spinor. It is readily 
verified that as a consequence of equations (4.5) both Asc and B4, satisfy second 
order wave equations. We see that aside from constant factors equations (4.5) 


are exactly the same as (4.3) and (4.4) provided we identify Ij, and jg» in 
the latter two. 


We will now determine the tensor equations equivalent to (4.5). Since Asc 
is symmetric, we may write 


(4.6) Asc = TAS” ac and B4, - vec 


where A,, is a self dual antisymmetric tensor and ¢, is a covariant vector, which 
is real if Bic is Hermitian. The first of equations (4.5) becomes 


AB tA tA tA 
g° (4A pS’ 5c), = 39°’ Aw,oNerg c= g°’ Aor,eg Ac es Keg Cy 
or 


(4.7) A”, = Kg’. 


Th 








re 
st 


i- 






or 





(4.8) 





(4.9) 


which follows from (4.7). Multiplying equations (4.8) by S,,°” and summing, 


we obtain 


(4.10) 


Equations (4.7) and (4.10) together are equivalent to the set (4.5), since the 
latter may be derived from these equations in the manner in which we derived 
the spinor form of the Maxwell equations. 
from the equations proposed by Proca’ in that the antisymmetric self dual 
tensor A, is not the curl of a vector ¢, but only the self dual part of the curl 
If the vector were real, the equations (4.7), (4.9) and (4.10) 
could be decomposed into two sets, which would be exactly the Proca equa- 


of the vector. 


tions for real ¢, . 


The tensor equations equivalent to the spinor equations 


(4.11) 


which were proposed by Kemmer are 


(4.12) 


where B4” = gag 
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The second of equations (4.5) becomes 





o T. i K , 
g Acg “Der ~FZ Aw OD y 






S” co¢r,e + 9 €cv¢r,¢ So 






Hence we must have 











—KA, = Npr\¥r,0 . 





Equations (4.7) and (4.10) differ 














pcA = 2m’g, , 
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A CORRECTION 
R. P. Boas, Jr. 
(Received December 1, 1938) 


tA 

4 In my paper, Representations for entire functions of exponential type,’ I 
made the remark that a function f(z), of exponential type R, and bounded 

pa i on the real axis, can have the form 


je) = [ e*dato, 


ea) with a(t) not of the class Lip 0; but my assertion that “trigonometric poly- 
es) nomials furnish examples” is obviously incorrect, since Lip 0 is naturally to 
hy be interpreted as the class of bounded functions, not the class of continuous 
functions. A correct example is 





a/2 
f( = / sin zt d(log sin #) 
0+ 
2/2 
=| sin zt ctn tdt 
Q 


a/2 
= —Z [ log sin ¢ cos zt dt. 
0 
That f(x) is bounded for large |z| follows from the second integral, which is 
the Dirichlet integral for a function of bounded variation; the third integral 


shows that f(x) is bounded for small | x |. 


CAMBRIDGE, ENGLAND. 





1These Annals, 39 (1938), pp. 269-286; p. 284. 
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